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(57) Abstract: The present invention relates 
to inhibitors of coronaviruses including severe 
acute respiratory syndrome (SARS); methods of 
identifying inhibitors of coronaviruses; methods 
and compositions useful lor therapy (treatment, 
or prevention) of coronavirus infections. The 
coronavirus inhibitors target different regions of 
the coronavirus Spike protein to inhibit coronavirus 
replication. In particular embodiments, the 
inhibitors include Five-Helix proteins, HR2 
peptides, KR1 peptides. HR1 chimeric peptides. 
The present invention also features retro -inverso 
analogue of the coronavirus inhibitors, coronavirus 
inhibitors containing one or more tryptophan 
analogs, and / or one or more lactam bridges. 
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TITLE OF THE INVENTION 
INHIBITORS OF CORONAVIRUS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application claims benefit of U.S. provisional application 60/463,100 filed April 14, 
2003, U.S. provisional application 60/467,190 filed April 30, 2003, U.S. provisional application 
60/478,860 filed June 16, 2003, U.S. provisional application 60/479,429 filed June 18, 2003, and U.S. 
provisional application 60/479,430 filed June 18, 2003. All the applications are incorporated by 
reference. 

BACKGROUND OF THE INVENTION 

Coronaviruses are nonsegmented, s/s plus strand, enveloped RNA viruses that infect a variety of 
mammals and birds. In humans, they cause respiratory infections, enteric infections and possibly 
neurological conditions. Typically, infection results in a mild, self "limited condition, such as a cold or 
upset stomach. Recently, there have been reports that severe acute respiratory syndrome (SARS) is 
caused by a coronavirus, SARS-CoV. Although most SARS victims recover, the World Health 
Organization estimates that SARS is fatal in about 4% of cases, particularly in individuals with an 
underlying condition, such as a weakened immune response, diabetes or heart disease. 

Coronavirus envelope protein generally carries two glycoproteins, M or membrane glycoprotein 
and S or spike protein. S protein is involved in receptor binding and cell fusion and is comprised of two 
subunits: the N-teimmal subunit, called SI, which forms the knob-like component of the spike and the C- 
terminal subunit, called S2, which forms the stem of the spike. Little is known about the mechanism by 
which coronaviruses enter cells and presently there are no antiviral drugs consistently successful in 
treating any coronavjrus infection, including SARS, or vaccines available to prevent infection. 

The references cited herein are not admitted to be prior art to the claimed invention. 

SUMMARY OF THE INVENTION 

This invention relates to inhibitors of coronaviruses, such as SARS-CoV; methods of identifying 
inhibitors of coronaviruses; methods and compositions useful for therapy (treatment, or prevention) of 
coronavirus infections, including methods and compositions for therapy of severe acute respiratory 
syndrome (SARS); and compositions for use as vaccines and immunogens and methods in which they are 
used to vaccinate or immunize individuals against coronaviruses, including vaccines and immunogens for 
use in vaccinating or immunizing individuals against SARS. 
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In particular embodiments, the present invention relates to C-peptides, N-peptides, Five-Helix 
proteins, Heptad Repeat 2 (HR2) peptides, and Heptad Repeat 1 (HR1) coiled-coil peptides, all of which 
target HR1 or HR2 regions of the coronavirus Spike protein to inhibit coronavirus replication. They are 
for use as inhibitors of coronaviruses in therapy (treatment, or prevention) of coronavirus infection, 
5 including for use as vaccines or immunogens. The exact HR1 and HR2 regions may differ from the 

proposed regions. However, the proposed regions provided herein are sufficient to enable the design of 
the inhibitors. 

Reference to "comprising" and "comprises" throughout the claim indicates additional groups may 
be present. For example, a C-peptide comprising about 18 amino acids of SEQ ID NO: 1 may have 
10 additional groups such as arriino acid regions from SEQ ID NO: 1 or other types of groups. 

Reference to "peptide" in a C-peptide and N-peptide indicates amino acids joined together by a 
peptide bond and salt forms. The amino acids may be naturally occurring amino acids or non-naturally 
occurring amino acids. A non-naturally occurring amino acid (also referred to as a "modified" amino 
acid) contains a non-naturally occurring R-group. While "peptide" does not provide a size limitation, 
15 smaller length peptides are generally preferred. Smaller length peptides may offer advantages in ease of 
synthesis and solubility. 

Standard one and three letter codes for amino acids are as follows: A=Ala=Alanine: 
C=Cys=Cysteine: D=Asp=Aspartic acid: E=Glu=Glutamic acid: F=Phe=Phenyl alanine: G=Gly=Glycine: 
H=His=Histidine: I=Ile=Isoleucine: K=Lys=Lysine: L=Leu==Leucine: M=Met=Methionine: 
20 N=Asn=Asparagine: P=Pro=Proline: Q=Gln=Glutamine: R=Arg=Arginine: S=Ser=Serine: 
T=Thr=Threonine: V=Val=Valine: W=Trp=Tryptophan: and Y=Tyr=Tyrosine. 

According to an embodiment of the present invention, the peptides or proteins can be modified to 
contain conservative substitutions. Conservative substitutions for amino acids are shown in Table 1. 
Factors that can be taken into account for a conservative substitution include arnino acid size, charge, 
25 polarity, and hydrophobicity, (See, for example, Ausubel, Current Protocols in Molecular Biology, John 
Wiley, 1987-2001, Appendix 1C.) Conservative substitutions, if present, should substantially maintain or 
enhance either coronavirus inhibition properties or pharmacokinetic properties. In different 
embodiments, there are 0, 1, 2, 3, 4, 5, or 6-10 conservative substitutions. 



Table 1: Conservative Substitutions 



Arnino Acid 


Conservative Substitution 


Alanine 


D-Alanine, Glycine 


Arginine 


D-Arginine, Lysine, D-Lysine 
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Table 1: Conservative Substitutions 



Amino Acid 


Conservative Substitution 


Asoaraeine 


D-Asparagine, Glutamine, D-Glutarnine 


Aspartic acid 


D-Aspartic acid, Glutamic acid, D-Glutamic acid 


Cysteine 


D-Cysteine 


Glutamic acid 


D-Glutamic acid, Aspartic acid, D-Aspartic acid 


Glut amine 


D-Glutarnine, Asparagine, D-Asparagine 


Histidine 


D-Histidine 


Isoleucine 


D-Isoleucine, Valine, D-Valine, Leucine, D-Leucine . 


Leucine 


D -Leucine, Valine, D-Valine, Isoleucine, D-Isoleucine 


Lysine 


D-Lysine, Ornithine, D-Oraithine, 


Methionine 


D-Methionine, Norleucine, D-Norleucine 


Phenylalanine 


D-Phenylalanine, Tyrosine, D-Tyrosine 


Proline 


D-Proline 


Serine 


D-Serine, Threonine, D-Threonine 


Threonine 


D-Threonine, Serine, D-Serine 


Tryptophan 


D-Tryptophan, Tryptophan analog* 


Tyrosine 


D-Tyrosine, Phenylalanine, D-Phenyl alanine 


Valine 


D-Valine, Leucine, D- Leucine, Isoleucine, D-Isoleucine 



* The tryptophan analog can be D or L. 



According to an embodiment of the present invention, the peptides can be modified, e.g., 
containing one or more tryptophan analogs and / or one or more lactam bridges. 
5 The present invention also provides retro-inverso analogues of the inhibitors of coronavirus. 

As used herein, polypeptide refers to a peptide comprising about 10 to about 500 amino 
acids. Examples of polypeptides include C-peptides, N-peptides, the chimeric N-peptides, Five Helix, 
Six Helix, and their derivatives. 

According to an embodiment of the present invention, the coronavirus inhibitor is a purified 
10 polypeptide. A "purified polypeptide n represents at least 10% of the total polypeptide present in a sample 
or preparation. In preferred embodiments, the purified polypeptide represents at least about 50%, at least 
about 75%, or at least about 95% of the total protein in a sample or preparation. Reference to "purified 
polypeptide" does not require that the polypeptide has undergone any purification and may include, for 
example, chemically synthesized polypeptide that has not been purified. According to another 
15 embodiment of the present invention, the coronavirus inhibitor is a purified oligomer of polypeptide. 
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One aspect of the present invention describes a method for treating a patient to inhibit 
coronavirus replication. The method comprises the step of administering to the patient an effective 
amount of an inhibitor of coronavirus to the patient. 

Another aspect bf the present invention describes a method of inhibiting the ability of a human 
5 coronavirus to infect a cell in vitro. The method comprises the step of providing an inhibitor of 
coronavirus to a cell culture infected with the coronavirus. 

Reference to "patient" indicates a mammal undergoing treatment and does not necessarily 
indicate the presence of a coronavirus infection. Treatment can be provided to a patient infected with the 
coronavirus to decrease the severity of a coronavirus infection. Treatment can also be provided 
10 prophylactically to reduce the likelihood or severity of a coronavirus infection. A patient can be a human 
or another type of mammal. Treatment of non-human patients may be useful in protecting pets and 
livestock, or in evaluating the efficacy of a particular treatment. 

Reference to "effective amount" indicates an amount sufficient to provide a beneficial effect to a 
patient. Beneficial effects that can be achieved include detectable inhibition of viral replication or 
15 decrease in viral load. The inhibition or decrease should be sufficient to provide a medically significant 
decrease in the severity or likelihood of a coronavirus infection. 

Other features and advantages of the present invention are apparent from the additional 
descriptions provided herein including the different examples. The provided examples illustrate different 
components and methodology useful in practicing the present invention. The examples do not limit the 
20 claimed invention. Based on the present disclosure the skilled artisan can identify and employ other 
components and methodology useful for practicing the present invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the amino acid sequence (SEQ ID NO: 1) of the S or spike protein of human 

25 coronavirus recently identified in SARS victims, as well as the heptad repeats predicted by two different 
methods (Multicoil and Learncoil VMF). According to multicoil trimeric prediction (bold), HR1 includes 
amino acid residues from about 900 to about 974 of SEQ ID NO:l, while HR2 includes amino acid 
residues from about 1 148 to about 1 193 of SEQ ID NO: 1. According to learncoil VMF prediction 
(underlined), HR1 includes amino acid residues from about 900 to about 1005 of SEQ ID NO: 1, while 

30 HR2 includes amino acid residues from about 1 1 5 1 to about 1 1 85 of SEQ ID NO: 1 . 

Figure 2 shows the arnino acid sequences of C34 coil-like inhibitors, also referred to as Heptad 
Repeat 2 Peptides. The a, d and e residues from the heptad repeat of S ARS-HR2 are grafted onto the f , b, 
and c positions, respectively, of the GCN4 coiled coil to make (Cys)SARS-HR2-GCN4 ((Cys)HR2G, 
SEQ ID NO:6). Then, through the cysteine residue, this peptide is covalently coupled to (Cys)GCN4 
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(SEQ ID NO:7) to form a stable dimer exposing the SARS-HR1 binding interface on one side. One 
example is shown for the SARS coronavirus sequence. 

Figure 3 illustrates retro-inverso peptides. The parent linear peptide is shown in the middle 
(Boxed). The retro-inverso analog is shown in parallel (bottom) and antiparallel (top) orientation. It is 
5 apparent that in the antiparallel orientation the side-chain topology is maintained, while the direction of 
the peptide bond is reversed. In the retro-inverso peptide, all the amino acids are in the (D) configuration. 

Figures 4A, 4B, and 4C, provide examples of tryptophan analogs. 

DETAILED DESCRIPTION OF THE INVENTION 
10 This invention relates to inhibitors of coronaviruses, such as SARS-CoV. According to an 

embodiment of the present invention, the inhibitors comprise sequences derived from the heptad repeat 
regions (e.g., HR1 orHR2) of coronaviruses. 

1. The Heptad Repeat Regions of Viral Fusion Proteins 

15 Like other enveloped viruses, coronaviruses have a viral membrane containing glycoproteins that 

permit the viruses to bind to target cells and achieve membrane fusion, an essential process for viral 
infection, by which the viral and cellular membranes combine to form a single bilayer. The S 
glycoprotein on the viral membranes of coronaviruses is such a fusion protein, which participates in 
receptor binding and membrane fusion. The S protein comprises two heptad repeat regions, HR1 and 

20 HR2, which are predicted to collapse to form a trimer of hairpin structure in the fusion process. (See, 

DeGroot et al. Journal of Molecular Biology 196:963-966 (1987); Wolf, E. et al. Protein Science 6: 1 179- 
1189 (1997); Singh, M. et al. Journal of Molecular Biology 290:1031-1041 (1999)). 

Heptad repeat regions have been characterized by structural studies in membrane-fusion proteins 
of a variety of different viruses. In the fusion processes, the heptad-repeat sequences all form trimeric 

25 hairpin-like structures, in which a central three-stranded coiled coil is surrounded by supporting helices 
packed in an antiparallel manner. The trimer-of-hairpins structure is thought to bring the viral and 
cellular membrane together. The trimer-of-hairpins structure presumably corresponds to the fusion-active 
state. Extensive studies have shown that viral fusion proteins undergo a series of conformational changes 
to become the trimer-of-hairpins structure from a pre-hairpin intermediate conformation when exposed to 

30 the appropriate triggering signals (Hernandez et al. Annu Rev Cell Dev Biol 12: 627-661 (1996)). 
Various peptides were designed as exogenous inhibitors to bind to the pre-hairpin intermediate in a 
dominant-negative manner and consequently block formation of the fusion-active hairpin structures in 
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viruses such as HIV (Chan and Kim, Cell 93: 681-684 (1998); Eckert and Kim, Annu Rev Biochem 70: 
777-810 (2001)). 

The presence of the two heptad-repeat regions, HR1 and HR2, in S2 subunit of coronavirus S 
proteins indicates that they adopt a trimer-of-hairpins structure in their fusion process. As described 
5 herein, analysis predicts that the HR1 and HR2 regions collapse to form the trimer-of-hairpins structure 
consistent with the structure of HIV gp41. Peptides containing the HR1 and / or HR2 likely act in a 
dominant-negative manner to prevent the formation of the trimer-of-hairpins structure and inhibit 
infection of coronaviruses. Agents that prevent conformational changes which play a critical role in viral 
and cell membrane fusion, such as by stabilizing the native or intermediate states of the viral fusion 
10 proteins are expected to prevent fusion activation and block viral entry. 

Applicants have analyzed the amino acid sequences of a variety of coronaviruses, including that 
of the recently identified human coronavirus isolated from SARS victims. 
( http://vbweb.bcgsc.ca/sars/TOR2 draft genome assembly 120403.fasta.gz) . SARS-associated 
coronavirus is also described by Rota et al., Science 300, 1394-1399 (30 May 2003); published online 1 
15 May 2003, 10. 1 126/science. 1085952; and Marra et al, Science 300, 1399-1404 (30 May 2003); 
published online 1 May 2003, 10.1 126/science. 1085953. 

The analysis has shown that coronavirus S protein comprises two heptad repeat regions which are 
predicted to form a trimer of hairpin structure. As shown in Figure 1, results of analysis of SARS spike 
protein (SEQ ID NO:l) by two methods (Multicoil and Learncoil VMF) identified two heptad repeat 
20 regions that are predicted to form a trimer of hairpin structure as follows. 

Predicted by Multicoil, HR1 comprises amino acid residues from about 900 to about 974 of SEQ 
ID NO: 1; and by Learncoil VMF, HR1 comprises amino acid residues from about 900 to about 1005 of 
SEQ ID NO: 1. Predicted by Multicoil, HR2 comprises amino acid residues from about 1 148 to about 
1193 of SEQ ID NO:l; and by Learncoil VMF, HR2 comprises amino acid residues from about 1151 to 
25 about 1185 of SEQ ID NO:l. The composition (identity and order of amino acid residues) of the spike 
(S) protein of SARS coronavirus presented herein is defined with reference to the TOR2 draft genome 
assembly provided at the website referenced above. 

Based on the understanding of the structures and functions of coronavirus Spike (S) proteins, 
especially their HR1 and HR2 regions, various coronavirus inhibitors are designed, produced, and tested. 
30 Examples of the inhibitors are illustrated as follows. 
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2. HR2 Peptides 

A subject of this invention are peptides, referred to as HR2 (heptad repeat 2) peptides, which 
comprise at least 20 arnino acid residues from coronavirus heptad repeat 2 and are useful as inhibitors of 
coronavirus infection of mammalian cells, such as human epithelial cells. Such peptides axe also referred 
5 to as C-peptides, because the S2 subunit is at the C terminus of coronavirus S protein and the HR2 region 
is C-terminal of the HR1 region (Fig. 1). HR2 peptides are thought to bind to HR1 region in the pre- 
hairpin intermediate of S protein in a dominant-negative manner, block formation of the fusion-active 
hairpin structures, and consequently inhibit infection of coronaviruses such as SARS-CoV. 

HR2 peptides comprise from about 20 to about 60 amino acid residues from the predicted HR2 

10 region. In some embodiments, HR2 peptides comprise from about 20 to about 50 amino acid residues 
from the predicted HR2 region. Amino acid residues present in these HR2 peptides may be contiguous 
amino acid residues that occur sequentially in a coronavirus HR2 or may be noncontiguous arnino acid 
residues that are separated by intervening amino acid residues in the coronavirus HR2. In the 
embodiments of all peptides described herein in which noncontiguous arnino acid residues comprise the 

15 peptides, amino acid residues other than those that occur between the noncontiguous residues in the 

coronavirus (e.g., other than those that occur between the noncontiguous arnino acid residues in SARS in 
nature) are introduced in order to maintain, for example, appropriate distance between the residues and 
ensure that the resulting peptide has an appropriate conformation. An example of the noncontiguous 
amino acid residues is shown in Fig. 2 and section 3 as follows. 

20 Specific embodiments of SARS HR2 peptides include any combination of from about 20 arnino 

acid residues to about 60 amino acid residues of SARS HR2 peptide, homologues and variants of such 
sequences, including variants that result from correction of any errors in the TOR2 draft genome 
assembly published at http://vbweb.bcgsc.ca/sars/TQR2 draft genome assembly 120403.fasta,gz and 
additional SARS HR2 regions encoded by the genome of additional coronaviruses identified or obtained 

25 from SARS victims that differ from the referenced publicly available sequence. 

In specific embodiments, SARS HR2 peptides comprise from about 20 to about 60 ainino acid 
residues, which can be contiguous or noncontiguous in the region as it occurs in SARS virus, of the 
following sequence, which is residues 1144-1200 or 1142-1193 of the predicted spike protein of SARS 
Tor2(SEQIDNO:l). 

30 Amino acid residues 1 144-1200 SEQ ID NO: 1 has the sequence as follows: 

PDVPLGDI^INASVVNI QKEIDRLNEVAI^LNES L IDLQELGKYEQ YI KWPWYVWL (SEQ ID NO:2), in 
which solvent exposed regions of the HR2 include the b, c and f positions of the heptad repeat. The 
solvent exposed positions are underlined. 
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Additional peptides can be identified using known methods, such as by determining the location 
and composition (amino acid sequence) or heptad repeat 2 and selecting peptides of from about 20 to 
about 60 amino acid residues. Examples of such HR2 peptides are partial or full sequence of SEQ ID 
NO:2. HR2 peptides can consist of amino acid residues from N to (N+M-l) of SEQ ID NO:2, wherein M 
5 is the length of the HR2 peptide, and N can be any integral number from 1 to (58-M). 

In specific embodiments, HR2 peptides consist of amino acid residues from 5 to 50, from 9 to 41, 
from 21 to 57, from 5 to 41, from 30 to 57, or from 1 to 57 of SEQ ID NO:2. The region of residues from 
30 to 57 of SEQ 3D NO:2 appears to be very highly conserved among coronaviruses and, thus is a peptide 
of particular interest for designing or identifying C-peptide/HR2 peptide inhibitors. 
10 According to a preferred embodiment of the present invention, the HR2 peptide has a sequence of 

residues 1148-1182 or 1148-1185 of SEQ ID NO:l. Residues 1148-1182 of SEQ ID NO:l has the 
sequence as follows: LGDISGINASVVNIQKEIDRLNEVAKNLNESLIDL (SEQ ID NO:3). 

3. HR2 Coil-Like Inhibitors 

15 A further subject of this invention are coronavirus inhibitors, referred to as HR2 coil-like 

inhibitors, which are stable dimers that expose an HR1 binding site in such a manner that it is available 
for binding. HR2 coil-like inhibitors are thought to inhibit infection of coronaviruses such as SARS-CoV 
with the same mechanism as that of HR2 peptides. 

In this embodiment, amino acid residues a, d, and e of coronavirus HR2 (e.g., SARS-CoV-HR2) 

20 are grafted onto the f, b and c positions, respectively, of a coiled coil, such as the GCN4 coiled coil 

GCN4-pl, to produce (Cys)-HR2-coiled coil (e.g., (Cys)-SARS-CoV-HR2-GCN4). This peptide is then 
covalently coupled to (Cys) coiled coil (e.g., (Cys)GCN4) to form a stable dimer exposing the HR1 
binding interface of the coronavirus (here, the SARS-CoV-HRl binding interface) on one side. Examples 
of such HR2 coil-like inhibitors are shown in Figure 2. GCN4-pl is a dimeric coiled coil whose sequence 

25 was derived from GCN4, a yeast transcription activator (O'Shea, E. K., et al, Science 254:539-544 
(1991)). 

Additional coil-like inhibitors can be identified using similar techniques, which rely on 
identification of the heptad register, followed by application to the repeat sequence of one or more coil 
prediction programs (e.g., Multicoil, Learncoil VMF) and grafting of amino acid residues onto an 
30 appropriate coiled coil. See, for example, Sia, S. Doctoral Thesis entitled Rational Design of peptide 

inhibitors of HIV-1 entry. December 2001, submitted to the Committee on Higher Degrees in Biophysics 
in partial fulfillment of the requirements of Doctor of Philosophy in the subject of Biophysics, Harvard 
University (see also Proc Natl Acad Sci USA) 100:9756-61 (2003)). 
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4. HR1 Peptides 

HR1 (Heptad Repeat 1) peptides or inhibitors, also referred to as N-peptide inhibitors, are a 
further subject of this invention, HR1 peptides are thought to bind to HR2 region in the pre-hairpin 
intermediate of S protein in a dominant-negative manner, block formation of the fusion-active hairpin 
5 structures, and consequently inhibit infection of coronaviruses such as SARS-CoV. 

Such HR1 peptides generally include from about 28 to about 60 amino acid residues from 
coronavirus HR1 peptide, although in some cases, they may include fewer than 28 or more than 60 HR1 
amino acid residues. The amino acid residues in the HRl-peptide inhibitors can be residues that occur 
sequentially (are contiguous) or nonsequentially (are noncontiguous) in the coronavirus HR1 as it occurs 
10 in nature. The terms Contiguous and noncontiguous and sequential and nonsequential are used with 

reference to the amino acid sequence of a peptide or protein (e.g., HR1, HR2) as it occurs in the virus in 
nature. 

A wide variety of HR1 peptides can be identified and produced, using known methods and the 
present invention encompasses all HR1 peptides from about 28 to about 60 or about 80 amino acid 

15 residues from the HR1 repeats described herein. In particular embodiments, HR1 peptides of the present 
invention include all peptides of from about 28 to about 60 or about 80 amino acid residues of SARS 
coronavirus HR1, defined with reference to the deduced amino acid sequence encoded by the referenced 
publicly available SARS genome data (SEQ ID NO:l and Fig. 1). In addition, the amino acid sequences 
of HR1 peptides can be altered, such as by addition, substitution or deletion of one or more amino acid 

20 residues, in order to alter certain characteristics (e.g., to increase peptide stability, enhance peptide 
solubility). 

HR1 peptides can have a length of from about 28 to about 60 or about 80 amino acid residues, 
and be derived from residues 900-1005 or 889-972 of SEQ ID NO: 1. Examples of HR1 peptides include, 
but are not limited to: A peptide having a sequence of residues 896-972, 900-938, 914-949, 922-956, 943- 
25 980 or 943-981 of SEQ ID NO:l. 

Another embodiment of this invention is a complex of an HRl-peptide inhibitor tethered (with an 
appropriate linker, which can be made of amino acid residues, or can be a chemical or synthetic linker) to 
a component, such as a ligand (protein or small molecule) that binds coronavirus envelope protein. The 
complex is useful as a therapeutic agent, because the ligand will increase the effective concentration of 
30 the tethered HRl-peptide inhibitor moiety in the vicinity of the coronavirus envelope protein. 

HR1 peptides are capable of forming trimeric coiled coil. The HR1 trimeric coiled coil could be 
stabilized by including cysteine residues to form interchain disulfide bonds in a chemically synthesized or 
recombinant peptide greater than or equal to 28 and less than or equal to 60 or 80 sequential residues from 
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the predicted HR1 region. The recombinant versions could contain helix capping residues on the N- and 
C-terminus. Possible capping residues for the N-terminus are T, D and S. Possible capping residues for 
the C-terminus are G, R, H, N, or K. Possible amino4errninal caps for the synthetic peptides are acetyl 
(Ac) or succinimide (sue). For SARS virus, the HR1 comprises the sequence of residues 900-1005. 
5 Possible inhibitors of SARS-CoV include, but are not limited to: 

CCG TTTSTALGKLQDWNQNAQ ALNTLVKQLS SNFGAI S (SEQIDNO:8), 
CCG ENOKQIANQFNKAISQIQESLTTTSTALG (SEQ ID NO:9), and 
CCG ALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDR (SEQ ID NO: 10). 

10 5. HR1 Chimeric Peptides 

The present invention also relates to HR1 chimeric peptides comprised of an HR1 peptide fused 
to a peptide that is capable of forming a solubilizing trimeric coiled coil. The HR1 chimeric peptides 
form stable and soluble HR1 chimeric trimeric coiled coil. HR1 chimeric peptides are thought to inhibit 
infection of coronaviruses such as SARS-CoV with the same mechanism as that of HR1 peptides. 
15 Examples of the HR1 chimeric peptides include HR1-IQ peptides and HR1-IZ peptides. 

5.1. The HR1 peptide portion of the HR1 chimeric peptides 

According to an embodiment of the present invention, a HR1 chimeric peptide comprises at least 
a portion of HR1 of coronavirus S2 protein, as disclosed in sections 1 and 4. The HR1 region component 
20 of these peptides can be of varying length and will typically be from about 15 to about 50 amino acid 
residues from the coronavirus HR1 region. The HR1 peptide, in some cases, may be shorter than 15 
amino acid residues or longer than 50 amino acid residues. 

5.2. The peptides capable of forming solubilizing trimeric coiled coils 

25 The solubilizing trimeric coiled-coil peptide can be from (comprise amino acid residues that 

correspond to those in) a variety of sources, including IQ peptides and IZ peptides. 

IQ peptide can be from (comprise amino acid residues that correspond to those in) a variety of 
sources, including yeast transcription activator GCN4, such as GCN4-pH or GCN4«pIqI; Moloney Murine 
Leukemia Virus (MoMLV); and the ABC heterotrimer. 

30 The amino acid residues that comprise an IQ peptide of the present invention can be amino acid 1 

residues that are sequential (consecutive) or noncontiguous (nonconsecutive) in the trimeric coiled-coil 
peptide from which it is derived and/or amino acid residues that are sequential (consecutive) or 
noncontiguous (nonconsecutive) in HR1 peptide, provided that the resulting IQ peptide (the IQ peptide in 
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which they are present) is stable, soluble, helical, and trimeric under physiological conditions. In the 
embodiments of IQ peptides in which nonconsecutive amino acid residues of either or both components 
of the IQ peptide are present, the residues, as included in the IQ peptide, can be consecutive or can be 
separated or joined by a linker. The linker can be, for example, an amino acid residue(s) that does not 
5 occur between two amino acid residues in the peptide from which the component is derived. 
Alternatively, the "linker" can be a chemical or synthetic linker. 

A component of an IQ peptide or HR1-IQ peptide of the present invention is considered to be 
derived from another peptide (e.g., a trimeric coiled coil or HR1 of a coronavirus) if the component itself 
(or the nucleic acid moiecule(s) that encode the amino acid sequence) is obtained or isolated/separated 

10 from a source in which it occurs (e.g., from a cell in which the peptide occurs, such as a portion of a 

protein from which it can be removed) or is produced by recombinant DNA methods, chemical synthesis 
or any other method, to comprise an amino acid sequence or a nucleic acid sequence that is the same as or 
substantially the same as the sequences of the peptide. That is, the term is intended to be interpreted 
broadly and does not require that a component be physically derived from the peptide referred to. 

15 The trimeric coiled coil peptide component of an IQ peptide must be sufficient in amino acid 

composition (identity and number/length) to result, when joined to the HR1 peptide portion, in formation 
of a soluble trimeric helical (coiled-coil) IQ peptide. 

In the embodiments in which the soluble IQ peptides comprise an IQ region that is a GCN4 
trimeric coiled-coil peptide, they are referred to as IQN peptides. As discussed in section 3, GCN4-pl is 

20 a dimeric coiled coil whose sequence was derived from GCN4, a yeast transcription activator (O'Shea, E. 
K., et al, Science 254:539-544 (1991)). Based on GCN4-pl, trimeric coiled-coil peptides have been 
developed, such as GCN4-pH (Harbury, P. B„ et al, Nature 371:80-3 (1994)) and GCN4-pI Q I (Eckert 
D.M., et al, J. Mol Biol. 284:859-65 (1998)). The trimeric coiled-coil peptides can be used to fuse with 
the HR1 sequence. 

25 The chimeric HR1-IQN peptides comprise all or a portion of GCN4-pI Q I or a modified version 

of all or a portion of GCN4-pI Q I, such as a modified portion that includes mutations for increased 
solubility, and an HR1 peptide of from about 15 to about 50 arnino acid residues. 

In certain embodiments of the IQN peptides of the present invention, the trimeric coiled-coil 
peptide, referred to as the "GCN4 portion", comprises at least 15, 16, 17, 18, 19 or 20 arnino acid residues 

30 of GCN4 (S. K. Sia and P. S. Kim, Proc. Natl. Acad. Set U.S.A. 100:9756-9761 (2003)). The amino acid 
residues present in the components of an IQN peptide can correspond to amino acid residues that are 
sequential (consecutive) or nonsequential (nonconsecutive) in, respectively, the GCN4 transcription 
activator (or GCN4-pIqI) and HR1 peptide or a modified version of the activator or the HR1 peptide, 
provided that the resulting IQN peptide is an inhibitor of coronavirus infection of cells, as described 
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herein. The IQ and IZ peptides of the present invention can be produced as a continuous peptide or as 
components that are joined or linked after they are formed. As used herein, the terms "joined" or "joined 
in such a manner" or "incorporated" include incorporating amino acid residues by either approach. 
For example, the GCN component of an IQN peptide can comprise consecutive amino acid 
5 residues from GCN4-pI Q I, modified, if desired (e.g., to increase solubility). Alternatively, amino acid 
residues that are not consecutive in the GCN4 activator (or in GCN4-pIqI), joined in such a manner that 
they are nonconsecutive or consecutive in the resulting GCN4 component of an IQN peptide, can be 
incorporated in the IQN peptide. Similarly, the amino acid residues of the HR1 peptide component of an 
IQN peptide of the present invention can be amino acid residues that occur consecutively or 

10 nonconsecutively in HR1 peptide of SARS S protein and can be incorporated into in IQN peptide in such 
a manner that they are consecutive or nonconsecutive in the resulting peptide. In the embodiments in 
which nonconsecutive amino acid residues are used, they can be separated by one or more "linker" 
molecules, if needed to retain the respective functions/characteristics of the components and of the IQN 
peptide. For example, an amino acid residue(s) other than the residue(s) that normally occur between two 

15 amino acid residues of GCN4 or HR1 peptide can be used to link or join the two amino acid residues in 
the IQN peptide. Alternatively, the linker can be a chemical or synthetic linker, for example. IQN36, as 
well as versions of IQN17 that are shortened in the 'IQ' region, are also described. These shortened 
versions may be therapeutically advantageous because, for example, they are easier and less expensive to 
produce than are larger peptides. 

20 The examples of IQ peptides include: 

GCN4-pIqI: RMKQIEDKIEEILSKQYHIENEIARIKKLIGER (SEQ ID NO: 1 1) 
GCN4-pI Q F: RMKQIEDKIEEIESKQKKIENEIARIKKLIGERY (SEQ ID NO: 12) 

A specific embodiment of an IQN peptide is IQN17, which contains 29 residues of GCN4-pI Q I, 
including three mutations for increased solubility. The sequence of IQN17 consists of residues 1-29 of 

25 SEQ ID NO: 12. 

Shortened versions of IQN17, which each include a shorter GCN component than is present in 
IQN17, are also the subject of this invention. Specific examples of these shortened IQN17 peptides are: 

a) shortened IQN17 #1, consisting of residues 22-29 of SEQ ID NO: 12; 

b) shortened IQN17 #2, consisting of residues 15-29 of SEQ ID NO: 12; 
30 c) shortened IQN17 #4, consisting of residues 8-29 of SEQ ID NO: 12; 

d) shortened IQN17 #5, consisting of residues 8-28 of SEQ ID NO: 12; and 

e) shortened IQN17 #3, in which there are 15 non-HRl amino acid residues KIKKIENEIARIKKL 
(SEQ ID NO: 13). This is GCN4-plQr with a Q to I mutation, and is referred to as GCN4-pII\ 
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The solubilizing trimeric coiled-coil peptide can also be from other designed trimeric coiled coils, 
such as the isoleucine zipper (IZ) described by Tanaka et al. (Suzuki, K., Hiroaki, H., Kohda, D. & 
Tanaka, T. Protein Eng. 11:1051-1055 (1998)), or derivatives of this 'IZ* sequence. In those 
embodiments in which the trimeric coiled-coil peptide is from the isoleucine zipper, they are referred to 
5 as IZ peptides. 

Tanaka's isoleucine zipper: Ac-YGG IEKKI EAIEKKI E AI EKKI E AIEKKI E A-NH2 (SEQ ID NO: 14) 
The "IZ" molecule derivative: Ac-YGGIKKEIEAIKKEQEAIKKKIEAIEKEIEA-NH 2 (SEQ ID NO: 15). 

Moreover, the solubilizing trimeric coiled-coil peptide can comprise a trimeric coiled-coil peptide 
from HIV. 

10 

5.3. HR1 chimeric peptide configurations 

In one embodiment, the components of the HRl-IQ peptides are present in the following "order": 
N-ternimus--trimeric coiled-coil peptide-HRl peptide- C-terminus. Alternatively, the order of 
components can be reversed: N-tenninus-HRl peptide-trimeric coiled-coil peptide-C-terrninus. 
15 HR1-IQN17 and other HR1-IQN peptides can be tested for their ability to inhibit coronavirus 

infection of cells using known methods. 

Soluble HR1 chimeric peptides of the present invention comprise, in specific embodiments, the 
amino acid sequences shown as follows: 

TIKKEIEAIKKEOEAIKKKIEAIEKEI TTTSTALGKLQDWNQNAQALNTLVKQLSSNFGAIS G (SEQ ID 

20 NO: 16), or a peptide consisting of residues 2-63 of SEQ ID NO: 16; 

TRKKOTEDKIEEIESKOKKIENEIARIKKLI TTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAIS G 

(SEQ ID NO: 17), or a peptide consisting of residues 2-67 of SEQ ID NO: 17; 

RMKOIEDKIEEIESKOKKIENEIARIKKLI SQIQESLTTTSTALGKLQPVVNQNAQALNTLVKQLS (SEQ 
ID NO: 18); 

25 RMKQIEDKIEEIESKOKKIENEIARIKKLI ENQKQIANQFNKAISQIQESLTTTSTALG (SEQ ID 
NO:19); 

TRMKQTEDKIEEIESKOKKIENEIARIKKL ENQKQIANQF3STKAISQIQESLTTTSTALG (SEQ ID 
NO:20); 

TIKKEIEAIKKEOEAIKKKIEAIEKEI ENQKQIANQFNKAISQIQESLTTTSTALG (SEQ ID NO:21), or 

30 a peptide consisting of residues 2-56 of SEQ ED NO: 2 1 ; 

TIKKEIEAIKKEOEAIKKKIEAIE KRLQSLQTYVTQQLIRAAEIRASAN (SEQ ID NO:22), or a peptide 
consisting of residues 2-49 of SEQ ID NO:22; 
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TRMKOIEDKIEEIESKQKKIENEIARIK KRLQSLQTYVTQQLIRAAEIRASAN (SEQ ED NO:23), or a 
peptide consisting of residues 2-53 of SEQ ID NO:23; 

TRMKOIEDKIEEIESKQKKIENEIARIK KALNTLVKQIjSSNFGAISSVLNDILSRLDKVEAEVQIDR 
(SEQ ID NO:24), or a peptide consisting of residues 2-67 of SEQ ID NO:24; 
5 TIKKEIEAIKKEQEAIKKKIEAIE KALNTLVKQ (SEQ ID 

NO:25), or a peptide consisting of residues 2-63 of SEQ ID NO:25. 

IKKEIEAIKKEQEAIKKKIEAIEKEI SQIQESLTTTSTALGKLQDWNQNAQALNTLVKQLS (SEQ ID 
NO:26); and 

IKKEIEAIKKEOEAIKKKIEAI 3WLYENQKQIANQFNKAISQIQESLTTTSTALGKLQ (SEQ ID NO: 27). 

10 

Examples of IZ peptides, referred to as IZN36, IZN23 and IZN17-like inhibitors of SARS 
coronavirus are shown as follows. 

IKKEIEAIKKEQEAIKKKIEAIEKEI ENQKQIANQFNKAISQIQESLTTTSTALGKLQDWN (SEQ ID 
NO:28), or a peptide consisting of residues 1-49 of SEQ ID NO:28; 
15 IKKEIEAIKKEQEAIKKKIEAIEKEI NQNAQALNTLVKQLSSNFGAISSVXjNDILSRLDKVE (SEQ ID 
NO:29), or a peptide consisting of residues 1-49 of SEQ ED NO:29; 
IKKEIEAIKKEQEAIKKKIEAIE KSLTTTSTALGKLQDWN (SEQ ED NO:30); 
IKKEIEAIKKEQEAIKKKIEAIEK AISSVLNDILSRLDKVE (SEQ ID NO:31); 

SQIQESLTTTSTALGKLQDVWQNAQALNTLVKQLSS IKKEIEAIKKEQEAIKKKIEAIEKEIG (SEQ ID 
20 NO:32) 

Such inhibitor peptides can be administered to an individual, such as in a pharmaceutical 
composition comprising one or more (at least one) of the IZ peptides and an appropriate carrier (e.g., an 
appropriate buffer or carrier) to prevent or inhibit SARS infection in the individual. 

25 

6. Five-Helix Proteins 

Another embodiment of the present invention encompasses Five-Helix or Five-Helix proteins. 
The Five-Helix proteins comprise three heptad repeat 1 (HR1) components and at least two and up to, but 
not including, three heptad repeat 2 (HR2) components of coronavirus S2 protein, and are soluble under 
30 physiological conditions. The Five-Helix proteins interact with or bind to HR2 of coronavirus S2 protein; 
and inhibit coronavirus infection of mammalian cells, particularly human cells, such as epithelial cells of 
the upper respiratory tract or gastrointestinal tract. Thus, Five-Helix includes three HR1 components and 
at least two HR2 components of coronavirus S2 protein. It can also include a portion of a third 
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coronavirus S2 HR2, but does not comprise three entire S2 protein HR2 components. In each case, the 
HR1 and HR2 components are separated by linkers, preferably amino acid residue linkers, which can be 
of any length. The amino acid linkers can be, for example, GGGSSGGGSGG (SEQ ID NO:33). 

Also the subject of the present invention is Six-Helix protein, which comprises three HR1 
components and three HR2 components of coronavirus S protein, joined by linkers, such as amino acid 
residue linkers. In other embodiments, the amino acid sequence of Six-Helix differs from those shown in 
the figure by addition, deletion, substitution or alteration of at least one amino acid residue. Six-Helix 
protein is useful not only for producing Five-Helix, but also as a negative control in screening for drugs 
that inhibit coronavirus infection of cells. 

In one embodiment, a Five-Helix protein of this invention comprises three heptad repeat 1 (HR 1) 
components and at least two and up to, but not including, three heptad repeat 2 (HR2) components of an 
S2 protein of a coronavirus that causes SARS; interacts with or binds to HR2 of the SARS-CoV S2 
protein; and inhibits infection of human cells, such as infection of cells of the epithelium of the 
respiratory tract. 

In one embodiment, Five-Helix can be represented as: HRl-Unker-HR2-linker-HRl-linker-HR2- 
linker-HRl, wherein HR1 represents a coronavirus HR1 and HR2 represents a coronavirus HR2. In a 
further embodiment, Five-Helix can be represented as: HRl-liiiker-HR2-lirucer-mi-lmker-HR2-linker- 
HRl-linker-partial HR2 wherein HR1 represents a coronavirus HR1 and HR2 represents a coronavirus 
HR2 and "partial HR2" indicates that this component is a portion of a coronavirus HR2. The term Five- 
Helix or Five-Helix protein encompasses all such embodiments (those including three HR1 components 
and two or more, but less than three, HR2 components, separated by appropriate linkers). In specific 
embodiments, the HR1 and HR2 components are human coronavirus heptad repeats and partial HR2 is a 
partial (less than a complete) human coronavirus heptad repeat. 

In further specific embodiments, the Five Helix protein can be represented as: HR1 of SARS 
virus-linker-HR2 of SARS virus-Iinker-HRl of SARS virus-linker-HR2 of SARS virus-linker-HRl of 
SARS. In further specific embodiments, the Five Helix protein can be represented as: HR1 of SARS 
virus-linker-HR2 of SARS virus-linker-HRl of SARS virus-linker-HR2 of SARS virus-linker-HRl of 
SARS-linker-partial HR2 of SARS. 

In specific embodiments, the Five-Helix proteins comprise HR1 components having the sequence 
of from about residue 900 to about residue 1013 of SEQ ID NO: 1, HR2 components having the sequence 
of residues from about 1148 to about 1 193 of SEQ ID NO: 1, and amino acid linkers therebetween. The 
HR1 and/or HR2 SARS sequence can be altered by the addition, deletion or substitution of one or more 
amino acid residues to produce additional Five-Helix proteins. 
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In a specific embodiment, a Five-Helix protein comprises three HR1 repeats having the following 
amino acid sequence: the residues 900-1005 of SEQ ID NO:l, and at least two and up to, but not 
including, three HR2 repeats having the following amino acid sequence: the residues 1 151-1 185 of SEQ 
ID NO:l. In another embodiment, a Five-Helix protein comprises three HR1 repeats having the 
5 following amino acid sequence: the residues 900-974 of SEQ ID NO: 1, and at least two and up to, but not 
including, three HR2 repeats having the following sequence: the residues 1 148-1 193 of SEQ ID NO: 1 . In 
another specific embodiment, a Five-Helix protein comprises three HR1 repeats having the amino acid 
sequence: the residues 900-1005 of SEQ ID NO:l, and at least two and up to, but not including, three 
HR2 repeats having the following sequence: the residues 1 148-1 193 of SEQ ID NO: 1. In a further 

10 specific embodiment, a five-helix protein comprises three HR1 repeats having the amino acid sequence: 
the residues 900-974 of SEQ ED NO: 1, and at least two and up to, but not including, three HR2 repeats 
having the following amino acid sequence: the residues 1151-1185 of SEQ ID NO: L 

According to an embodiment of the present invention, a Five-Helix protein has the sequence of 
residues 1-243 of SEQ ID NO:34, whereas the corresponding Six-Helix the sequence of SEQ ID NO:34: 

15 MVLYENQKQIANQFNK^ 

QNA GGGSS(^GSGGDISGINASVVNIQKEIDR1LiN^ 

jUSTQ FNKAISQIQESLTCTSTALGKLQD 
NESLIDLQEL (SEQ ID NO: 34). 
20 According to another embodiment of the present invention, a Five-Helix protein has the sequence 

of residues 1-237 of SEQ ID NO:35, whereas the corresponding Six-Helix the sequence of SEQ ID 
NO:35: 

MTALGKXjQDWNQNAQ ALNTLVKQL S SNFGAI S S VLNDILS GGGS SGGGSGGDISGINASWN1QKEIDR 

LNEVAKNIiNESLIDLQELGGGSSGGGSGG TALGKLQDVWQNAQALN 
25 GGS SGGGSGGDISGINASVVNIQKEIDRIiNEVAKNIiNESLIDLQEIiGGGSSGGGSGG TALGKLQDVVNQN 

AQALNTLVKQLS SNFGAI SSVLITO I 

LQEL (SEQIDNO:35). 

According to an alternative embodiment of the present invention, a Five-Helix protein has the 

sequence of residues 1-237 of SEQ ID NO:36, whereas the corresponding Six-Helix the sequence of SEQ 
30 ID NO:36: 

MQIDRLITGRLQSLQTYVTQQLIRAAEIRASAI^AATKMSE GGGSSGGGSGGDISGINASVVNIQKEIDR 

LNEV&KNIiNESLIDLQELGGGSSGGGSGG QIDR^ 

GGSSGGGSGGDISGINASVVNIQKEXDRIiNEVAKNIi^ 
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QTYVTOOLlRAAEIRASANLAATKMSEG 
LQKL (SEQ ID NO:36). 

According to an alternative embodiment of the present invention, a Five-Helix protein has the 
sequence of residues 1-317 of SEQ ID NO:37, whereas the corresponding Six-Helix the sequence of SEQ 
5 IDNO:37: 

riryr J2t\t fimiS'A fiWWT^K R T TYB T .WFTV ATCNTjNE S T-jX DLOELGKYEO YI KGGG S S GGG 5 G GGX GVTQNVL YE 

NQKQlANQFNKAISQIQESLTTTSra^ 

VVKITQTrRTnRrJSrg^AKNLNESL 

10 ISQIQESLTTTSTALGKLQDVVNQNAQALNTL^ 

EVAKNIiNESLIDLQELGKYEQYIK (SEQ ID NO: 37). 

The amino acid sequence of each of the components (the HR1 repeats and the HR2 repeats) can 
vary widely, provided that the resulting repeats form a trimer of hairpins. The sequences provided herein 
can be modified by the addition, deletion, or substitution of one or more amino acid(s). Amino acid 

15 residues in the C-peptide helix (HR2), particularly the amino acid residues that are solvent exposed (have 
high solvent accessibility) in the trimer of hairpin structure, can be changed to other amino acid residues 
with considerable flexibility. For example, such amino acid residues can generally be changed to 
(substituted with) naturally-occurring amino acid residue except, in general, proline, which can be used as 
a cap. Amino acid residues in the N-peptide helix (HR1) can also be changed. In the N-peptide, the 

20 amino acid residues that are buried in the coiled-coil trimer (buried in the trimer interface) can be 

substituted with amino acid residues that can be accommodated in the interface. Such characteristics of 
amino acid as size, charge and hydrophobicity must be taken into consideration in selecting amino acid 
substitutions in the N-peptide. 

As discussed above, the amino acid residue composition of Five-Helix of the present invention 

25 can vary greatly. In those instances in which Five-Helix is used in therapeutic applications, it must 

present a surface or region that is available for binding or interacting with coronavirus HR2. That is, the 
remaining surface of Five-Helix must be present in such a manner or conformation that it is available to 
bind or interact with coronavirus HR2. In the cases in winch Five-Helix is used as a drug-screening tool 
or an antibody-screening tool or as a vaccine or immunogen, Five-Helix need not bind or interact with 

30 coronavirus HR2. 

The HR1 components of Five-Helix proteins of the present invention can vary in amino acid 
composition (type and number). They will generally comprise from about 20 to about 100 amino acid 
residues of HR1 of coronavirus (e.g., human) S2 protein and each component can comprise any number 
of amino acid residues from about 20 to about 100. They will generally comprise from about 20 to about 
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60 amino acid residues of HR2 of coronavirus (e.g., human) S2 protein and each can comprise any 
number of amino acid residues from about 20 to about 60. The amino acid composition of each HR1 
component in a Five-Helix protein can be the same or different. The amino acid composition of each 
HR2 component in a Five-Helix protein can be the same or different. The amino acid residues included 
5 in HR1 and HR2 can occur sequentially/consecutively in the corresponding region of a coronavirus or 
may be nonconsecutive amino acids (e.g., amino acids that are not consecutive in the coronavirus HR1 
and/or HR2 can make up an HR1 or HR2 of a Five-Helix). 

In further embodiments, Five-Helix protein can comprise equivalent HR1 and HR2 components 
from other coronaviruses, which can be identified using known methods, such as Multicoil and Learncoil 

10 VMF. See, for example, Wolf, E. et al. Protein Science (June 1997) 6: 1 179-1 189, describing Multicoil 
and Singh, M. et al. Journal of Molecular Biology (1999) 290: 103 1-1041, describing Learncoil VMF. 
Alternatively, Five-Helix proteins of the present invention can differ from a sequence presented herein by 
addition, deletion, substitution or alteration of at least one amino acid residue. The linkers can be of any 
length or composition, provided that the Five-Helix protein conformation, described herein, is retained. 

15 Five-Helix can be an L-amino acid protein, a D-amino acid protein or a combination of L-amino acid 
residues and D-amino acid residues; these residues can be modified residues or non naturally occurring 
residues. 

7. The Retro-inverso Analogues 

20 The present invention further relates to the inhibitors that are retro-inverso analogues of the 

inhibitors derived from HR1 and/or HR2 of the coronaviruses such as SARS-CoV. The in vivo stability 
of a peptide is critical for its application in therapies. The retro-inverso analogues have higher stability 
than their counterparts, while reserving the capability of binding the pre-hairpin intermediate of S protein 
and inhibiting coronavirus inhibition. 

25 

7.1. The Retro-inverso Peptides 

The retro-inverso analogue of a peptide is more stable than the peptide, while it often maintains 
the biological activities of the peptide. A retro-in verso peptide is an isomer of a linear peptide in which 
the direction of the amino acid sequence is reversed, and the chirality of each amino acid residue is 
30 inverted (Goodman & Chorev, Acc. Chem. Res. 12: 1-7 (1979); Chorev & Goodman, M. Acc. Chem. Res. 
26:266-273 (1993); Chorev & Goodman, M. Trends Biotechnol 13:438-445 (1995); Fletcher & 
Campbell, Chem. Rev. 98:763-795(1998); Taylor, et al, J. Pharmacol. Exp. Tlier. 295:190-194 (2000)). 
This results in inversion of each peptide bond within the peptide sequence, while the overall topology of 
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the side-chains is maintained, except for the tenninal groups. Hence, when a retro-inverso peptide is 
superimposed onto the parent peptide in an antiparallel fashion, the overall topology of the side-chains is 
maintained (See Fig. 3). 

Because most proteins are composed of L-arnino acids, retro-inverso peptides are usually 
5 peptides composed of D-amino acids. As used herein, D-chirality is indicated with the lowercase, while 
L-chirality is indicated with the uppercase. For arnrno acids comprising more than one chiral center, like 
isoleucine and threonine, the amino acid used in the retro-inverso analog may comprise any combination 
of the possible configurations of the said amino acid. As used herein, "i" may represent (D)-Isoleucine, 
(D)-allo-isoleucine, or (L)-allo-isoleucine, while "t" may represent (D) threonine, (D)-aUo-threonine, or 

10 (L)-allo-threonine. 

When the interaction of a peptide ligand with another peptide or protein is mainly dependent on 
side-chain topology, and much less so on contacts established with the peptide amide backbone, the 
binding affinity of the parent sequence can be preserved in the retro-inverso analog. There are several 
examples of retro-inverso peptides, which maintain the same biological activity of the parent peptides (for 

15 recent examples see Levi et aL, Res. Human Retroviruses 16:59-65 (2000); Tayor et al., 2000; Pescarolo 
et al., FASEB J. 15:31-33 (2000); Chen et al., X Med, Chem. 45 1624-1632 (2002); D'Ursi et aL, J. Med. 
Chem. 46 1807-1810 (2003)). In contrast, retro-inverso analogs were biologically inactive in a few cases. 
A possible cause might be the non-complementarity of the end group with the corresponding end groups 
of the linear peptide, as for (Fletcher & Campbell, Chem. Rev. 98:763-795(1998)). 

20 This non-complementarity, however, is unlikely to render inactive the retro-inverso analogues of 

the heptad repeats of viral fusion proteins. These peptides are thought to act in a dominant-negative 
manner, by substituting for one or more components of the trimer, which results from collapse of the HR 
1 and 2, thereby preventing the conformational changes necessary for fusion (Lambert et aL, Proc. Natl 
Acad. Set U.S.A. 93:2186-2191 (1996); Chan & Kim, Cell 93:681-684 (1998); Sodroski, Cell 99:243-246 

25 (1999)). This interaction occurs over a large surface area, and does not depend on a single, specific 

contact between the interacting peptide pairs. For instance, N-terminal truncations of the potent HIV-1 
fusion inhibitor DP-178 give rise to fully active peptides (Lawless et al., Biochemistry 35: 13697-13708 
(1996)). 

Furthermore, retro-inverso peptides invariably show extended stability in serum, because the 
30 inverted amide backbone makes them poor substrates for serum proteases (Goodman & Chorev, Acc. 
Chem Res. 12:1-7 (1979); Chorev & Goodman, M. Acc. Chem. Res. 26:266-273 (1993); Chorev & 
Goodman, M. Trends Biotechnol. 13:438-445 (1995); Levi et aL, Res. Human Retroviruses 16:59-65 
(2000); Tayor et aL, 2000; Pescarolo et aL, FASEB J. 15:31-33 (2000); Chen et aL, J. Med. Chem. 45 
1624-1632 (2002); D'Ursi et aL, J. Med. Chem. 46 1807-1810 (2003)). The increased stability in serum 
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is a key advantage of retro-inverso peptides, since it may result in increased serum half-life and improved 
PK properties. Indeed, D'Ursi et al J. Med. Chem. 46 1807-1810 (2003)) show that the retro-inverso 
analog of an octapeptide inhibitor of Feline Immunodeficiency Virus is only 4-fold less active than the 
parent peptide in vitro, but it is 6-fold more potent in the presence of 50% Normal Cat Serum. 

5 

7.2. The Retro-inverso Analogues of Coronavirus HR Peptides 

According to an embodiment of the present invention, the inhibitor comprises a retro-inverso 
analogue of any of the inhibitory peptides or proteins disclosed herein, including C-peptides, N-peptides, 
Five-Helix proteins, Heptad Repeat 2 (HR2) peptides, and Heptad Repeat 1 (HR1) coiled-coil peptides. 
10 The inhibitor comprises a retro-inverso analogue of a sequence from coronavirus heptad repeat 1 or 
heptad repeat 2. 

According to an embodiment of the present invention, the coronavirus is a human coronavirus or 
a mouse coronavirus. According to a preferred embodiment of the present invention, the coronavirus is a 
human SARS coronavirus (SARS-CoV). 

15 According to an embodiment of the present invention, the inhibitor comprises a retro-inverso 

analogue of a sequence from coronavirus heptad repeat 2 (HR2); the sequence has a length of at least 20 
amino acid residues. According to a preferred embodiment of the present invention, the inhibitor is a 
retro-inverso analogue of HR2 peptide or C-peptide disclosed herein. The inhibitor comprises a retro- 
inverso analogue of the predicted HR2 region. The HR2 sequence has a length of from about 20 to about 

20 60 amino acid residues, or preferably from about 20 to about 50 amino acid residues. Amino acid 

residues present in these inhibitors may be a retro-inverso sequence of amino acid residues that occur 
sequentially in a coronavirus HR2 or may be amino acid residues that are separated by other (intervening) 
amino acid residues in the coronavirus HR2. 

Like corresponding HR2 inhibitors, the inhibitors can bind to the HR1 regions of the pre-hairpin 

25 intermediate in a dominant-negative manner to block formation of the hairpin structures, and thereby 
prevent the infection of coronavirus to the cells. 

Possible inhibitors include, but are not limited to, peptides comprising a retro-inverso analogue of 
the arnino acid sequences disclosed in sections 2-5 and peptides that have an amino acid sequence 
sufficiently similar to any of those sequences to function in the same manner. In particular embodiments, 

30 a inhibitor comprises a retro-inverso analogue of some or all of the amino acid sequence of SARS virus 
HR2, as disclosed in section 2. The SARS fusion inhibitors comprise from about 20 amino acid residues 
to about 60 amino acid residues. Amino acid residues present in these retro-inverso inhibitors may be a 
sequence of amino acid residues that occur consecutively (contiguously) in a coronavirus HR2 or may be 
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amino acid residues that are separated by other (intervening) amino acid residues noncontiguously) in the 
coronavirus HR2. 

The inhibitor can comprise a retro-inverso analogue of any of the sequences derived from SEQ 
ID NO:2, as disclosed in section 2. According to an embodiment of the present invention, the inhibitor 
5 can be any of the sequences derived from the retro-inverso analogue of SEQ ID NO:2, which is 

lwvywpwkiyqeykgleqldils Examples of such 

HR2 retro-inverso analogues are partial or full sequence of the retro-inverso analogue of SEQ ID NO:2. 
HR2 retro-inverso analogues can consist of amino acid residues from N to (N+M-l) of the analogue, 
wherein M is die length of the HR2 peptide, and N can be any integral number from 1 to (58-M). 
10 According to a further preferred embodiment of the present invention, the inhibitor comprises 

Idilsenlnkavenlrdiekciinwsanigsidgl, which is the retro-inverso analogue of SEQ ID 
NO:3. . 

According to an embodiment of the present invention, the fusion inhibitor of a coronavirus 
comprises a retro-inverso analogue of a sequence of at least 28 amino acid residues from coronavirus 

15 heptad repeat 1 (HR1). Like corresponding HR2 inhibitors, the retro-inverso inhibitors can bind to the 
HR1 regions of the pre-hairpin intermediate in a dominant-negative manner to block formation of the 
hairpin structures, and thereby prevent the infection of coronavirus to the cells. The inhibitor is retro- 
inverso HR1 peptide or N-peptide. The inhibitor can be the retro-inverso analogue of any of the HR1 or 
N-peptide inhibitors disclosed herein. 

20 According to an alternative embodiment of the present invention, the inhibitor comprises a retro- 

inverso analogue of soluble HR1-IQ peptides or HR1-IZ peptides, which are comprised of a soluble 
trimeric coiled coil fused to an HR1 peptide, which has the sequence of at least a portion of HR1 of 
coronavirus S2 protein. The peptides comprise a peptide of at least 15 amino acid residues from HR1 of a 
coronavirus fused in the proper heptad repeat register to a soluble trimeric coiled coil, such as GCN4-pIqI 

25 or XL. The HR1 region component of these peptides can be of varying length and will typically be from 
about 15 to about 50 amino acid residues from the coronavirus HR1 region. 

The inhibitor can be the retro-inverso analogue of any of the HR1 coiled-coil inhibitors disclosed 

herein. 

According to a further preferred embodiment of the present invention, the inhibitor comprises a 
30 peptide having a retro-inverso analogue of sequence selected from the group consisting of the partial or 
full sequence of SEQ ID NO: 16-28 . 
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8. Inhibitors Containing Tryptophan Analogs 

The present invention further relates to C-peptide and N-peptide coronavirus inhibitors containing 
one or more tryptophan analogs. 

A variety of different C-peptide and N-peptide inhibitors can be obtained based on the disclosure 
5 provided herein including the identification of HR1 and HR2 sequences, and the use of tryptophan 
analogs. Tryptophan analogs can be used to increase serum half-life and improve pharmacokinetic 
properties. Additional regions that may be present include additional coronavirus regions and other types 
of regions such as enhancer regions. Enhancer regions either substantially maintain or enhance either 
coronavirus inhibitor properties or pharmacokinetic properties. Examples of enhancer regions are 
10 provided in Barney et cd„ International Publication Number WO 01/03723, published January 18, 2001. 

8.1. Tryptophan Analogs 

Tryptophan analogs can be present in either D or L stereochemistry. Preferred tryptophan analogs 
are modified amino acids having an R group (D or L stereochemistry) independently selected from the 
15 group consisting of: 





22 



WO 20U5/0025U0 



PCT/US2004/011210 



wherein X is O, S, or NR 7 ; 
YisC-R 5 orN; 
WisNorCH; 
Z is CH or N; 

5 Ri, R 2 , and R 5 are independently selected from the group consisting of hydrogen, 

halogen, C r C 6 alkyl, Ci-C 6 alkylamino, amino, and carboxyl; 

R 3 and R4 are either (1) independently selected from the group consisting of hydrogen, 
halogen, Ci-C 6 alkyl, C r C 6 alkylamino, amino, and carboxyl; or (2) joined together to provide a fused 
benzene ring; 
10 R 6 is H or methyl; 

R 7 is H or linear, branched or cyclic C r C 6 alkyl; and 
nisOorl; 

further provided that in the case of formula I, Rj may also be CH 2 where there is a bond 
from this CH 2 to the alpha-NH of the modified amino acid. 
15 Specific examples of different tryptophan analogs are shown in Figures 4A, 4B and 4C. The 

tryptophan analogs are shown as amino acids. Amino acids are joined together in a peptide by peptide 
bonds. 

8.2. Additional HR Peptide Structures 
20 Preferably, enhancer regions are shorter length amino acid sequences up to about eight amino 

acids. In an embodiment of the present invention, the enhancer region at the N-terminus is XQEXEQK 

(SEQ ID NO:38), whereas the enhancer region at the C-terrninus is XPXYVXL (SEQ ID NO:39), where 

each X is independently either tryptophan or a tryptophan analog. 

According to an embodiment of the present invention, tryptophan analogs may be present in the 
25 HR2 based region or an additional amino acid region located at the N-terminus or C-terminus of the HR2 

based region. 

In an embodiment of the present invention the C-peptide consists of: 

(a) a HR2 based region comprising about 18 amino acids of SEQ ID NO: 2 optionally modified 
with one or more conservative substitutions thereof; 
30 (b) an optionally present N-terminus enhancer group joined to the N-terminus of the HR2 based 

region; 

(c) an optionally present C^errninus enhancer group joined to the C-terminus of the HR2 based 

region; 

provided that at least one tryptophan analog is present, 
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further provided that the C-peptide N-terminus, C-terminus, or both the N-terminus and C- 
terminus may contain a protecting group. 

In a more preferred embodiment a C-peptide consists of 

(a) a HR2 based region consisting of a sequence selected from the group consisting of: 

5 amino acid residues from N to (N+M-l) of SEQ ID NO:2, wherein M is the length of the HR2 

peptide, and N can be any integral number from 1 to (58-M). 

(b) an optionally present N-terminus enhancer group, preferably WQEWEQKI (SEQ ID NO:40), 
where one or both tryptophans may be replaced with a tryptophan analog; 

(c) an optionally present C-terminus enhancer group, preferably, WPWYVWL (SEQ ID NO:41) 
10 where one, two or all three tryptophans may be replaced with a tryptophan analog; 

provided that at least one tryptophan analog is present; and 

provided that the C-peptide N-terminus, C-terminus, or both the N-terminus and C-terminus may 
contain a protecting group. 

In a further preferred embodiment, According to a preferred embodiment of the present invention, 
15 the HR2 peptide has a sequence of Z-LGDISGINASWNIQKEIDRLNEVAKNLNESLIDLXPXYVXL 
(SEQ ID NO:42), and 

a variant thereof comprising one or more conservative substitutions; 

wherein Z is an optionally present enhancer group, wherein each X is independently either 
tryptophan or a tryptophan analog; 
20 provided that the C-peptide N-terminus, C-terrninus, or both the N-terminus and C-terminus may 

contain a protecting group. 

9. Inhibitors Containing Lactam Bridges 

The introduction of one or more lactam bridges into a C-peptide or N-peptide can be used to 
25 stabilize a conformation providing advantages over the corresponding structure not containing lactam 

bridges. Advantages that can be provided include one or more of the following; resistance to proteolytic 
enzyme, improved serum half-life, enhanced biological activity, and enhanced pharmacokinetic 
properties. The same enhanced stability, coupled with cross-reactivity of the antibody response between 
the linear and the corresponding cyclic constrained sequences, may make the peptides more suitable than 
30 their linear counterparts for generation of an immune response as part of a vaccine. 
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9.1. Lactam Bridges 

A lactam bridge is formed between a carboxyl moiety of a first amino acid and an amino moiety 
of a second amino acid. The carboxyl and amino moieties form an amide bond. 

The general structure of an individual amino acid and amino acids in a peptide (tripeptide with 
5 ionized amino and carboxyl groups) are as follows (where R is an amino acid R group): 



H 



OH 



H 2 N CH— C 



/ 



10 



V ft ? H H 

+ H 3 N- C— C-N— C— C-N-C-COO" 
| I II I 
R H H O R 



Introduction of lactam bridges into a particular peptide can make use of amino acids having 
suitable groups or involve modifying the peptide to insert one or more suitable amino acids. Suitable 
amino acids for forming a lactam bridge include naturally occurring amino acid and non-naturally 

15 occurring amino acids. 

Suitable amino acids are amino acid pairs where one amino acid contains a free amino and 
another contains a free carboxyl. Examples of amino acids containing an R-group carboxyl moiety 
include naturally occurring amino acids such as aspartic acid and glutamic acid, and non-naturally 
occurring amino acids such as 2-aminohexanedioic acid and 2-arninoheptanedioic acid. Examples of 

20 amino acids containing an R-group amino moiety include naturally occurring amino acids such as lysine 
and ornithine, and non-naturally occurring amino acids such as diaminobutyric acid and 
diaminopropanoic acid. 

Cyclization in an (i, i + 3), (i, i + 4) and (i, i + 7) spaced side chain to a side chain lactam bridge 
stabilizes turn conformations and helical segments. (Taylor Biopolymers (Peptide Science) 66:49-75, 
25 2002.) A peptide can be modified to contain amino acids with appropriate groups (carboxyl and amino 
pairs) suitably spaced apart. 

9.2. The Structures of HR Peptides Containing Lactam Bridges 

In an embodiment of the present invention, the C-peptide consists of a sequence selected from the 
30 group consisting of: any of the sequences derived from SEQ ID NO:2, as disclosed in section 2; 
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that is (a) modified to contain one or more lactam bridges each of which is independently in an (i, 
i + 3), (i, i + 4) or (i, i +7) orientation; (b) optionally modified to contain one or more conservative 
substitutions; (c) optionally modified to contain a C-terminus deletion; and (d) optionally modified to 
contain a N-terminus deletion, 

provided that the N-terminus, C-terminus, or both the N-terminus and C-terminus may contain a 
protecting group. 

Examples of such HR2 peptides are partial or full sequence of SEQ ID NO:2. HR2 retro-inverso 
analogues can consist of amino acid residues from N to (N+M-l) of SEQ ID NO:33, wherein M is the 
length of the HR2 peptide, and N can be any integral number from 1 to (58-M). 

Examples of peptides containing (i, 1+3), (i 7 i+4), or (i, i+7) lactam bridges are: 

(U+3) 

I (Lys) (Asp) 

P(Asp)VD(Lys)GDISGmASVVMQKEro 
I I 

(SEQ ID NO:43) 



(U+4) 

(Dab) (Glu) 
2. DVDLG(Glu)ISG(Dab)NASVVMQKEro 
I I 

(SEQIDNO:44) 
(U+7) 

(Glu) (Orn) 
3- VDLGDISGINA(Om)VVMQK(Glu)^ 



(SEQIDNO:45) 



10. The Production of the Inhibitors 

The inhibitors can be produced by a variety of methods, including by means of recombinant DNA 
or RNA methods and / or known chemical synthetic methods. They can also be obtained from a library 
of peptides, such as a combinatorial peptide library. They may be capped or uncapped with certain amino 
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acids. Recombinant versions can contain helix capping residues on the N- terminus, the C-terminus or 
both. Possible capping residues for the N-terminus are amino acid T, D and S and for the C-terminus 
include amino acid G, R, H, N, and K. 

The peptides may also contain a C-terminus cap (also referred to herein as a C-terminus 
5 "protecting group") or a N-terminus cap (also referred to herein as a N-terminus "protecting group**). A 
C-terminus protecting group is a modification to the C-terminal amino acid that reduces the reactivity of 
the carboxyl terminus under in vivo conditions. The carboxyl terminus is preferably modified at the a- 
carbonyl group. Examples of modification include chemical groups such as amide, methylamide, and 
ethylamide. 

10 An N^erminus protecting group is a modification to the N-tenninal amino acid that reduces the 

reactivity of the amino terminus under in vivo conditions. The N-terminal amino acid can be modified 
with groups such as acetyl, propionyl, succinyl, benzyl, benzyloxycarbonyl or f-butyloxycarbonyl. 

10.1. The chemical synthesis of the peptides 
1 5 According to an embodiment of the present invention, the peptides are produced using standard 

techniques including those involving step-wise chemical synthesis. Techniques for chemical synthesis of 
peptides are well known in the art. (See e.g., Vincent, in Peptide and Protein Drug Delivery, New York, 
N. Y, Dekker, 1990; E. Atherton & R. C. Sheppard, Solid Phase Peptide Synthesis. A Practical Approach. 
IRL Press, Oxford, England, 1989; J. Jones, The Chemical Synthesis of Peptides, Oxford University 
20 Press, New York, 1991; M. Bodanszky & A. Bodanszky, The Practice of Peptide Synthesis, 2 ud ed., 
Springer-Verlag, Berlin Heidelberg, 1994; Fields, G., Ed., Solid-Phase Peptide Synthesis. Methods 
Enzymol. 289, Academic press, New York, 1997). 

Tryptophan analogs can be produced and incorporated into a peptide taking into account well 
known techniques. Different tryptophan analogs can be obtained from commercial sources such as RSP 
25 Amino Acid Analogues Inc., Bachem, and Sigma; or based on techniques such as those described by 
Huber et al., Tetrahedron Letters 43:6129-6733, 2002, Fredrickson et al, European Journal of 
Pharmacology 455(1-2): 11 1-1 18, 2003, Mohammadi etal, Biochemistry 40(34): 1027 3-10283, 2001, 
Rajh et al, International Journal of Peptide & Protein Research 14(1 ):68-79, 1979, and Cady et al, 
Archives of Biochemistry and Biophysics 291 (2 J:326-333, 1991. 
30 Examples of techniques that can be used to produce and study peptides containing lactam bridges 

are provided by Taylor, Biopolymers (Peptide Science) 66:49-15, 2002 and Judice et al., Proc. Natl 
Acad Set USA 94:13426-13430, 1997. 

The retro-inverso peptides of the present invention may be chemically synthesized, using the 
same methods utilized for the synthesis of normal peptides. See, e.g., J. Biol Chem. 270:11921-11926 
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(1995), Briand et al., Proc. Natl. Acad. Sci. U.S.A. 94:12545-12550 (1997), Fletcher & Campbell, Chem. 
Rev. 98:763-795(1998), Levi et al. y Res. Human Retroviruses 16:59-65 (2000); Tayor et al, 2000; 
Pescarolo et al, FASEB J. 15:31-33 (2000); Chen et al, J. Med. Chem. 45 1624-1632 (2002); D'Ursi et 
aU J. Med. Chem. 46 1807-1810 (2003) 

5 

10.2 The production of Five-Helix proteins with recombinant DNA technologies 

The present invention further relates to DNA encoding Five-Helix; methods of producing Five- 
Helix. Five-Helix protein can be produced by a variety of methods. For example, it can be produced 
from a larger protein, such as 6-Helix, by enzymatic (trypsin) digestion. Alternatively, it can be 

10 produced, using known methods and expression systems, by expressing Five-Helix protein-encoding 
DNA, which can be a single DNA that encodes the entire Five-Helix protein or two or more DNA 
lt units", each of which encodes a portion (e.g., one or more HR1 component, one or more HR2 
components) of Five-Helix protein. 

The yield of expression and purification of Five-Helix can be significantly improved by direct 

15 expression of the Five-Helix gene in an appropriate host cell, such as E. coll In this approach, the Five- 
Helix gene encodes the residues present in the final Five-Helix protein. A C-terminal His-tag can be 
attached to facilitate purification (with or without a protease cleavage site to later remove the tag). The 
protein can then be used directly without the proteolytic cleavage and unfolding steps required for 
producing Five-Helix starting from Six-Helix. 

20 This Five-Helix molecule may be expressed as a folded active molecule, allowing its use in 

biological selections or screens for optimizing its properties. Alternatively, protein synthetic methods can 
be used to produce Five-Helix protein. The HR1 and HR2 components of Five-Helix can be joined 
covalently (such as by means of a linker of at least one (one or more) amino acid residues) or by other 
means which results in formation of a protein which is stable under physiological conditions and is 

25 correctly folded such that the remaining surface of Five Helix is presented so that it is available to bind or 
interact with coronavirus HR2. Li the embodiments in which there are three HR1 components and more 
than two (but less than three complete) HR2 components, the components can be similarly joined. 

Five-Helix can be assessed to determine if it acts on the virus before it enters the cell. Five-Helix 
should be soluble under physiological conditions and should also be stable under physiological 

30 conditions. 

It should also be possible to generate 5-Helix variants with an increased molecular weight (by 
oligomerization or tethering to a large protein) to reduce the rate of kidney clearance. In addition, Five- 
Helix dimers can be made by disulfide cros slinking, to produce a molecule filtered to a lesser extent than 
the Five-Helix "monomer". Thus, it is reasonable to expect that dimers might have an enhanced 
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bioavailability when compared to that of the C-peptides. Bioavailability of Five-Helix can be affected 
(enhanced) by producing variants in nonbacterial cells, in which the protein would be glycosylated as it is 
produced. Alternatively, glycosylation sites can be engineered into Five-Helix. 

Another embodiment of this invention is a complex of Five-Helix tethered (with an appropriate 
linker, which can be made of amino acid residues, or can be a chemical or synthetic linker) to a 
component, such as a ligand (protein or small molecule) that binds coronavirus envelope protein. The 
complex is useful as a therapeutic agent, because the ligand will increase die effective concentration of 
the tethered Five-Helix moiety in the vicinity of the coronavirus envelope protein. 

1 1. Utilities of the Coronavirus Inhibitors 

The present invention relates to methods in which the peptides and Five-Helix proteins are used. 
The peptides and Five-Helix are useful as therapeutic agents or drugs to prevent coronavirus infection or 
reduce the extent to which it occurs; as a reagent for identifying (screening for) or designing other 
therapeutics or prophylactics effective against coronavirus infection. 

In one embodiment of the present invention, the peptides and Five-Helix are used to reduce 
coronavirus infection in an individual, m this embodiment, the peptides or Five-Helix is adrninistered to 
an individual in sufficient quantity to reduce (totally or partially) infection of the individual's cells. That 
is, a dose of the peptides or Five-Helix sufficient to reduce coronavirus infection (an effective dose) is 
administered in such a manner (e.g., by injection, topical administration, intranasal route, intravenous 
route) that it inhibits (totally or partially) coronavirus entry into cells. The peptides or Five-Helix can be 
aaministered to an individual who is infected, to reduce further infection, or to an uninfected individual to 
prevent infection or reduce the extent to which infection occurs. The Five-Helix can also be administered 
via expression of Five-Helix-encoding DNA in appropriate host cells or vectors. 

The present invention provides a method of inhibiting infection of human cells by coronavirus in 
an individual, comprising administering to the individual a pharmaceutical composition comprising the 
retro-inverso analogue of a SARS-CoV HR peptide in sufficient quantity to inhibit infection of human 
cells by the S ARS virus, and a pharmaceutical acceptable carrier, wherein infection of human cells is 
inhibited. 

11.1 Administration 

HR-peptide coronavirus inhibitors can be formulated and administered to a subject using the 
guidance provided herein along with techniques well known in the art. The preferred route of 
administration ensures that an effective amount of compound reaches the target. Guidelines for 
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pharmaceutical administration in general are provided in, for example, Remington's Pharmaceutical 
Sciences 18 th Edition, Ed. Gennaro, Mack Publishing, 1990, and Modern Pharmaceutics 2 tld Edition, Eds. 
Banker and Rhodes, Marcel Dekker, Inc., 1990. 

HR-peptide coronavirus inhibitors can be prepared as acidic or basic salts. Pharmaceutically 
5 acceptable salts (in the form of water- or oil-soluble or dispersible products) include conventional non- 
toxic salts or the quaternary ammonium salts that are formed, e.g., from inorganic or organic acids or 
bases. Examples of such salts include acid addition salts such as acetate, adipate, alginate, aspartate, 
benzoate, benzenesulfonate, bisulfate, butyrate, citrate, camphorate, camphorsulfonate, 
cyclopentanepropionate, digluconate, dodecylsulfate, ethanesulfonate, fumarate, glucoheptanoate, 

10 glycerophosphate, hemisulfate, heptanoate, hexanoate, hydrochloride, hydrobromide, hydroiodide, 2- 

hydroxyethanesulfonate, lactate, maleate, methanesulfonate, 2-naphthalenesulfonate, nicotinate, oxalate, 
pamoate, pectinate, persulfate, 3-phenylpropionate, picrate, pivalate, propionate, succinate, tartrate, 
thiocyanate, tosylate, and undecanoate; and base salts such as ammonium salts, alkali metal salts such as 
sodium and potassium salts, alkaline earth metal salts such as calcium and magnesium salts, salts with 

15 organic bases such as dicyclohexylamine salts, N-methyl-D-glucamine, and salts with amino acids such 
as arginine and lysine. 

HR-peptide coronavirus inhibitors can be administered using different routes such as an 
intravenous route, an intraperitoneal route, a subcutaneous route, a topical route, an intramuscular route, 
or an intranasal route. When administered by injection, the injectable solution or suspension may be 
20 formulated using suitable non-toxic, parenteraily-acceptable diluents or solvents, such as Ringer's solution 
or isotonic sodium chloride solution, or suitable dispersing or wetting and suspending agents, such as 
sterile, bland, fixed oils, including synthetic mono- or diglycerides, and fatty acids, including oleic acid. 

Pharmaceutically acceptable carriers facilitate storage and administration of a HR-peptide to a 
patient. Pharmaceutically acceptable carriers may contain different components such as a buffer, sterile 
25 water for injection, normal saline or phosphate buffered saline, sucrose, histidine, salts and polysorbate. 

Suitable dosing regimens are preferably determined taking into account factors well known in the 
art including type of subject being dosed; age, weight, sex and medical condition of the subject; the route 
of administration; the renal and hepatic function of the subject; the desired effect; and the particular 
compound employed. 

30 Optimal precision in achieving concentrations of drug within the range that yields efficacy 

without toxicity requires a regimen based on the kinetics of the drug's availability to target sites. This 
involves a consideration of the distribution, equilibrium, and elimination of a drug. The daily dose for a 
subject is expected to be between 0.01 and 1,000 mg per subject per day. 
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HR-peptide coronavirus inhibitors can be provided in kit. Such a kit typically contains an active 
compound in dosage forms for administration. A dosage form contains a sufficient amount of active 
compound such that a beneficial effect can be obtained when administered to a patient during regular 
intervals, such as 1 to 6 times a day, during the course of 1 or more days. Preferably, a kit contains 
5 instructions indicating the use of the dosage form for treating coronavirus and the amount of dosage form 
to be taken over a specified time period. 

11.2. Immunogens 

HR-peptides providing appropriate epitopes may be used as an immunogen for anti-coronavirus 

10 antibody production. The immunogen can be used, for example, as a vaccine component or to produce 
monoclonal or polyclonal antibodies that are subsequently isolated and used to target the coronavirus 
Spike protein. In an embodiment, the invention relates to a method of eliciting an immune response to 
coronavirus in an individual, comprising introducing, by an appropriate route, a composition comprising 
Five-Helix and a physiologically acceptable carrier, in a dose sufficient to elicit the immune response in 

15 the individual. Vaccines comprising the peptides or Five-Helix (or a variant or portion thereof) in a 
physiologically acceptable carrier are the subject of this invention. 

Immunogens can be formulated as a vaccine or used for antibody production and isolation taking 
into account the guidance provided herein along with techniques well known in the art. Techniques for 
producing and using antibodies are well known in the art. Examples of such techniques are described in 

20 Ausubel, Current Protocols in Molecular Biology, John Wiley, 1987-1998, Harlow et al„ Antibodies, A 
Laboratory Manual, Cold Spring Harbor Laboratory, 1988, and Kohler et al, Nature 256:495-497, 1975. 

Guidelines for pharmaceutical vaccine administration in general are provided in, for example, 
Vaccines Eds. Plotkin and Orenstein, W.B. Sanders Company, 1999; Remington's Pharmaceutical 
Sciences 2Cf h Edition, Ed. Gennaro, Mack Publishing, 2000; and Modern Pharmaceutics 2 nd Edition, Eds. 

25 Banker and Rhodes, Marcel Dekker, Inc., 1990. 

A vaccine can be administered in conjunction with an adjuvant. Adjuvants are substances that 
can assist an immunogen in producing an immune response. Adjuvants can function by different 
mechanisms such as one or more of the following: increasing the antigen biologic or immunologic half- 
life; improving antigen delivery to antigen-presenting cells; improving antigen processing and 

30 presentation by antigen-presenting cells; and inducing production of immunomodulatory cytokines. 
(Vogel, Clinical Infectious Diseases 30 (suppl. 3):S266-270, 2000.) 

A variety of different types of adjuvants can be employed to assist in the production of an 
immune response. Examples of particular adjuvants include aluminum hydroxide, aluminum phosphate, 
other salts of aluminum, calcium phosphate, DNA CpG motifs, monophosphoryl lipid A, cholera toxin, E. 
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coli heat-labile toxin, pertussis toxin, muramyl dipeptide, Freund's incomplete adjuvant, MF59, SAF, 
immunostimulatory complexes, liposomes, biodegradable microspheres, saponins, nonionic block 
copolymers, muramyl peptide analogues, polyphsophazene, synthetic polynucleotides, IFN-y, IL-2 and 
IL-12. (Vogel Clinical Infectious Diseases 30(suppl 3):S266-270, 2000, Klein et al„ Journal of 
5 Pharmaceutical Sciences 89, 311-321, 2000.) 

Vaccines can be administered by different routes such as subcutaneous, intramuscular, or 
mucosal. Subcutaneous and intramuscular administration can be performed using, for example, needles 
or jet-injectors. 

Suitable dosing regimens are preferably determined taking into account factors well known in the 
10 art including age, weight, sex and medical condition of the patient; the route of administration; the desired 
effect; and the particular compound employed. The immunogen can be used in multi-dose vaccine 
formats. It is expected that a dose would consist of the range of 1 jag to 1.0 mg total polypeptide, in an 
embodiment of the present invention the range is 0.1 mg to 1.0 mg. 

The timing of doses depends upon factors well known in the art. After the initial administration 
15 one or more booster doses may subsequently be administered to maintain or boost antibody titers. An 
example of a dosing regime would be day 1, 1 month, a third dose at either 4, 6 or 12 months, and 
. additional booster doses at distant times as needed. 

The retro-inverso peptides can also be used as vaccines against SARS. Several studies have 
documented that the retro-inverso analogs of linear peptide epitopes can be useful as vaccines (Guichard 
20 etaU Proc. Natl Acad. Set. U.S.A. 91:9765-9769 (1994); Benkirane et aU /• Biol Chenu 270:11921- 
1 1926 (1995); Briand et al, Proc. Natl Acad. Set U.S.A. 94: 12545-12550 (1997); Meziere et al, 1997; 
Muller & Briand, Res. Immunol. 149:55-57 (1998)). More specifically it was shown for Foot-and-Mouse 
disease virus (FMDV) that (i) antibodies raised against the natural epitope recognize the retro-inverso 
peptide; (ii) the retro-inverso peptide is able to generate in rabbits an immune response of much higher 
25 titer, and longer duration, than the corresponding linear peptide; (iii) the antibodies generated by 

immunization with the retro-inverso peptide are cross reactive with FMDV viral particles, and (iv) the 
immune response generated in guinea pigs by immunization with the retro-inverso peptide is protective 
against FMDV challenge (Briand et al, Proc. Natl Acad. Sci. U.S.A. 94:12545-12550 (1997)). 

Therefore retro-inverso peptides, corresponding to the sequence of the Heptad Repeats 1 and 2 of 
30 the SARS coronavirus, can be used, either as such or, more desirably, conjugated to a protein carrier like 
OMPC, for the induction of a protective immune response against the virus. 
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11.3. Other Utilities 

la a specific embodiment, the invention relates to a method of identifying a compound or 
molecule that binds the peptides or Five-Helix and inhibits coronavirus infection of mammalian cells. 
The compound or molecule to be assessed is referred to as a candidate inhibitor. The method comprises 
5 combining a candidate inhibitor and the peptides or Five-Helix under appropriate binding conditions, and 
determining and selecting the compound or molecule that binds the peptides or Five-Helix. The method 
optionally further comprises determining and selecting the selected compound or molecule that inhibits 
coronavirus infection of rnammalian (e.g., human) cells, such as in a cell-based assay. Such a compound 
or molecule may be used to inhibit (totally or partially) coronavirus infection of human or non-human 
10 animals. 

Other embodiments are within the following claims. While several embodiments have been 
shown and described, various modifications may be made without departing from the spirit and scope of 
the present invention. 

15 EXAMPLES 

Examples are provided below to further illustrate different features of the present invention. The 
examples also illustrate useful methodology for practicing the invention. These examples do not limit the 
claimed invention. 



20 Example 1: The Assays Used to Test the Peptide Activity 

The identification of SARS coronavirus HR1 andHR2 sequences provides substrates that can be 
employed to evaluate the ability of compound to inhibit HR1 and HR2 interactions. Compounds that can 
be evaluated include C-peptides and N-peptides. 

Different types of assay formats can be employed to measure the ability of a compound to inhibit 
25 HR1 and HR2 interactions. Detectable labels on either or both HR1 and HR2 can be used to help 

measure HR1 and HR2 interactions. Examples of different types of labels include fluorescent labels, 
radioactive labels, and donor-acceptor pairs (e.g., FRET labels). 

Peptide activity can also be measured by providing a peptide to a cell infected with the 
coronavirus. Using techniques well known in the art, different types of assays and endpoints can be 
30 measured to evaluate peptide activity. For example, an assay can be performed to measure a compound' s 
ability to prevent virus infection and cell-to-cell spread over a 72-hour incubation; where antiviral activity 
is assayed by measuring retention of the cells' ability to phagocytose neutral red. An example of such as 
assay (based on USAMRHD assay) is as follows: 
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1) Incubate cells with different concentrations of test peptide (may be done in triplicate). The 
different concentrations may be serial dilutions and should include a control with no peptide. Peptide 
dilutions can be performed in a suitable buffer (e.g., EMEM, 1% fetal bovine serum, 1.0% L-glutarnine 
(100 X), 1.0% HEPES, 0.1% Funigzone (250 ug/ 20 ml), 0.1% Gentamicin (50 mg/ml)) added to a 96- 

5 well microliter plate of confluent Vero 76 cells (VERO 76, ATCC number CRL-1587). At different drug 
concentrations wells are inoculated with SARS associated coronavirus (e.g., MOI of 0.01) or are not 
infected. 

2) Plates are incubated at 370C, 5% CO2 and examined daily until a desired cytophathic effect is 
seen in untreated cells (e.g., 4 +CPE). Cell viability can be assessed by measuring neutral red uptake. 

10 For example, neutral red can be added to give a final concentration of 0.22 mg/ml and the cells are 
incubated at 37°C for 1.5 hours. The medium is then removed; wells are rinsed three times with 
phosphate buffered saline and fixed in 10% buffered formalin for 15 minutes. Retained neutral red is 
solubilized with 100 ui of a 50% ethanol, 50% 0.01 M NH4H2PO4 (pH 3.5). Plates can then be rocked 
for 30 minutes at 150 RPM to liberate neutral red and the optical density (at 450 run) can be measured 

1 5 with a plate reading spectrometer. 

Other embodiments are within the following claims. While several embodiments have been 
shown and described, various modifications may be made without departing from the spirit and scope of 
the present invention. 

20 Example 2: An assay based on the pseudotyping of HIV cores with SARS-CoV Spike protein. 

The present invention also provides a SARS-spike dependent entry assay useful for screening 
inhibitors of SARS-CoV entry and fusion. In this assay, cells transiently or stably expressing angiotensin 
converting enzyme-2 (ACE-2) are infected with HIV-1 Luciferase reporter viruses pseudoryped with 
SARS-CoV Spike protein. 

25 HIV DNA that has a deletion in its envelope sequence and containing firefly luciferase in place of 

the Nef gene is co-transfected with a mammalian expression vector encoding codon optimized SARS- 
CoV Spike protein. A plasmid DNA encoding the NL4-3 viral clone, which contains a deletion in its 
envelope sequence and containing firefly luciferase in place of the Nef gene is co-transfected in 293T 
cells (a human embryonic kidney cell line stably expressing the Simian Virus 40 Large T antigen) with a 

30 mammalian expression vector encoding a codon optimized SARS-CoV Spike protein (See, e.g., 
Simmons, G., et aL, PNAS 101:4240-5 (2004)). 

The transfected cells produce in their supernatants HIV cores that are pseudotyped with SARS 
Spike protein. These reporter viruses are capable of entering 293T cells transiently or stably transfected 
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with expression vectors encoding the angiotensin converting enzyme-2 (ACE-2), a cellular receptor for 
SARS-CoV (Li, W.» et aL, Nature 426:450-4 (2003); Wong, S.K., et aL, Journal of Biological Chemistry 
279:3197-201 (2004)). 

Infection is monitored by measuring Firefly luciferase activity in cell lysates at 48-72h post- 
infection. The system offers a dynamic range (30-100 fold above infection with particles lacking virus 
envelopes). Entry through the ACE-2 receptor has been validated in both in transiently and stably 
transfected cells by demonstrating that particle entry is blocked in a dose-dependent fashion with 
antibodies directed to ACE-2. 
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WHAT IS CLAIMED: 
What is claimed is: 

1. A coronavirus inhibitor which is selected from the group consisting of 

a) a Five-Helix protein, 

b) an HR2 peptide, 

c) an HR1 peptide, 

d) an HR1 chimeric peptide, 

e) a D-peptide which is the retro-inverso analogue of b), c), or d); and 

f) a modified version of b), c), or d), containing one or more modifications selected from the 
group consisting of i) one or more tryptophan analogs, ii) one or more lactam bridges, and Hi) one or 
more protecting groups. 

2. The inhibitor of claim 1 which is derived from S protein of human SARS-CoV. 

3. The inhibitor of claim 2 further comprising a capping residue on the N-terminus, the C-terminus 
or both. 

4. The inhibitor of claim 3 wherein the capping residue on the N-terminus is selected from the group 
consisting of T, D and S and the capping residue on the C-terminus is selected from the group consisting 
of G, R, H, N, andK. 

5. The inhibitor of claim 2 further comprising an arnino-terminal cap. 

6. The inhibitor of claim 5 wherein the arnino-terminal cap is acetyl or succiriimide. 

7. A method of identifying a drug candidate that inhibits coronavirus infection, comprising 
combining a compound and the inhibitor of claim 1, under conditions appropriate for binding of the 
compound and the inhibitor to occur, and determining if binding occurs. 

8. The method of claim 7 further comprising determining if the compound that binds the inhibitor 
inhibits coronavirus infection of mammalian cells in a cell-based assay. 

9. A method of inhibiting coronavirus infection of cells in an individual, comprising administering 
to the individual the inhibitor of claim 1 in sufficient quantity and by an appropriate route, whereby 
infection of cells in the individual is inhibited. 

10. The inhibitor of claim 1 which is a Five-Helix protein being soluble under physiological 
conditions and comprising three heptad repeat 1 components of coronavirus S2 protein and at least two, 
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but not three complete, heptad repeat 2 components of coronavirus S2 protein, wherein the components 
are separated by linkers. 

1 1 . The inhibitor of claim 1 wherein the Five-Helix protein has an amino acid sequence selected from 
the group consisting of: 

5 the sequence of residues 1-243 of SEQ ID NO:34, 

the sequence of residues 1-237 of SEQ ID NO;35, 
the sequence of residues 1-237 of SEQ ID NO:36, and 
the sequence of residues 1-317 of SEQ ED NO:37. 

12. The inhibitor of claim 1 which is an HR2 peptide comprising at least 20 armno acid residues from 
10 coronavirus HR2 and is an inhibitor of coronavirus infection of eukaryotic cells. 

13. The inhibitor of claim 12 wherein the HR2 peptide has a sequence selected from the group 
consisting of: amino acid residues 5-50, 9-41, 21-57, 5-41, 30-57, and 1-57 of SEQ ID NO:2, and 1 148- 
1182 and 1148-1 185 of SEQ ID NO:l. 

14. The inhibitor of claim 1 which is an HR1 chimeric peptide having an amino acid sequence 
15 selected from the group consisting of: 

a peptide consisting of SEQ ID NO: 16, or residues 2-63 of SEQ ID NO: 16; 

a peptide consisting of SEQ ID NO: 17, or residues 2-67 of SEQ ID NO: 17; 

a peptide consisting of SEQ ID NO: 18; 

a peptide consisting of SEQ ID NO: 19; 
20 a peptide consisting of SEQ ID NO:20; 

a peptide consisting of SEQ ID NO:21, or residues 2-56 of SEQ ID NO:20; 

a peptide consisting of SEQ ID NO:22, or residues 2-49 of SEQ ID NO:21; 

a peptide consisting of SEQ ID NO: 23, or residues 2-53 of SEQ ID NO:22; 

a peptide consisting of SEQ ED NO:24, or residues 2-67 of SEQ ID NO:23; 
25 a peptide consisting of SEQ JD NO:25, or residues 2-63 of SEQ ID NO:24; 

a peptide consisting of SEQ ID NO: 26; 

a peptide consisting of SEQ ID NO:27; 

a peptide consisting of SEQ ID NO:28, or residues 1-49 of SEQ ID NO:25; 
a peptide consisting of SEQ ID NO:29, or residues 1-49 of SEQ ID NO:26; 
30 a peptide consisting of SEQ ED NO:30; 

a peptide consisting of SEQ ID NO:31; and 
a peptide consisting of SEQ ID NO: 32. 
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15. The inhibitor of claim 1 which is an HR1 peptide comprising about 28 amino acid residues from 
HR1 of a coronavirus. 

16. The inhibitor of claim 15 wherein the HR1 peptide have an amino acid sequence comprising 
amino acid residues from about amino acid residue 900 to amino acid residue 974 of the sequence of SEQ 

5 ID NO: 1 or amino acid residues from about 889 to about 1005 of the sequence of SEQ ID NO: 1. 

17. The inhibitor of claim 16 wherein the amino acid sequence is selected from the group consisting 
of sequences of residues 896-972, 900-938, 914-949, 922-956, 943-980, and 943-981 of SEQ ID NO:l. 

18. The inhibitor of claim 1 which is the retro-inverso analogue of an HR2 peptide, an HR1 peptide, 
or an HR1 chimeric peptide. 

10 19. The inhibitor of claim 1 comprising, 

an HR region selected from the HR1 or HR2 sequence of coronavirus, optionally modified with 
one or more conservative substitutions, and 
one or more tryptophan analogs, 

wherein said one or more tryptophan analogs may be present in said HR based region or an 
15 additional amino acid region located at the N-terminus or C-terminus of said HR based region. 

20. The peptide of claim 19, wherein each tryptophan analog is a modified amino acid having an R 
group independently selected from the group consisting of: 
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in iv 

wherein X is O, S, or NR 7 ; 
Yis C-R 5 orN; 
5 "W is N or CH; 

ZisCHorN; 

R b R 2 , and R 5 are independently selected from the group consisting of hydrogen, halogen, Q-Q 
alkyl, C r C 6 alkylamino, amino, and carboxyl; 

R 3 and R4 are either (1) independently selected from the group consisting of hydrogen, halogen, 
10 C r C 6 alkyl, C r C 6 alkylamino, amino, and carboxyl; or (2) joined together to provide a fused benzene 
ring; 

R$ is H or methyl; 

R 7 is H or linear, branched or cyclic C r C 6 alkyl; and 
nisOor 1; 

15 further provided that in the case of formula I, Ri may also be CH 2 where there is a bond from this 

CH 2 to the alpha-NH of said modified amino acid. 

21. The peptide of claim 20, wherein said peptide consists of: 
said HR region, 

an optionally present N-terminus enhancer group joined to the N-terminus of said HR region, and 
20 an optionally present C-tenninus enhancer group joined to the C-terminus of said HR region, 

provided that at least one tryptophan analog is present, 

further provided that said N-terminus, said C-terminus, or both said N-terminus and C-terminus 
of said peptide may contain a protecting group. 

22. The peptide of claim 21, wherein said N-terminus enhancer group is XQEXEQK (SEQ ID 
25 NO:38), wherein each X independently is either tryptophan or tryptophan analog. 
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23. The peptide of claim 21, wherein said C-terrninus enhancer group is XPXYVXL (SEQ ID 
NO:39), or residues 1, 1-2, 1-3, 1-4, 1-5, or 1-6 of SEQ ID NO:39, wherein each X 
independently is either tryptophan or tryptophan analog. 

24. The peptides of claims 22-23, wherein each X R group is independently selected from the group 
5 consisting of: 




o=(° H 



Trp(5-CH 2 NH 2 ) 



neo-Trp 



Trp(5-C0 2 H) 



CH 3 



10 Beta-MeTrp 



H 



nor-Trp 



Of 

N 
H 

Trp(5-CH 2 CH 2 NH 2 ) 




J 



HN 



Or 



(l-Benzimidazolonyl)alanine Trp(4,5,6,7-tetrafluoro) 
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H 2 N 




H 



Ala((3-(3-Quinolinyl)) Benzo-Trp Trp(5-NH 2 ) 




Trp(5-F) Trp(5-OH) 1-naphthylalanine 




5 2-naphthyl alanine beta-(3-Benzothienyl)Ala 7-azaTrp 




4-MeTrp 4-F-Tip 7-MeTip 




CH 3 



10 1-MeTrp beta-(3-Benzofuranyl)Ala . or said modified „^ no acid is 
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H 



.OH 



O 



l,2,3,4-tetrahydro-9H-pyrido(3,4-b)indole-3-carboxylic acid. 

25. The inhibitor of claim 1 wherein the inhibitor has an HR region selected from the HR1 or 
HR2 sequence of coronavirus, optionally modified with one or more conservative substitutions, and 

5 is: (a) modified to contain one or more lactam bridges each of which is independently in an (i, i + 
3), (i, i + 4) or (i, i +7) orientation; and (b) optionally modified to contain one or more 
conservative substitutions. 

26. The peptide of claim 25 wherein N-terminus, C-terminus, or both the N-tenninus and C- 
tenninus of the peptide contain a protecting group. 

10 27. The peptide of claim 25, wherein each of said lactam bridges is independently formed between: 
(a) an amino acid providing a carboxyl moiety, selected from the group consisting of aspartic acid, 
glutamic acid, 2-aminohexanedioic acid and 2-aminoheptanedioic acid; and (b) an amino acid providing 
an amino moiety, selected from the group consisting of lysine, ornithine, diaminobutyric acid, and 
diaminopropanoic acid. 
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1 MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSD 

5 1 TLYLTQDLFLPFYS3WTGFHTINHTFGNPVIPFKDGIYFAATEKSNWRG 

101 WVFGSTMJSDSTKSQSVIIINNSTISIWIR^ 

151 M I FDNAFNCTF EY I SDAF SLDVS EKS GNFKHLREF VFKNKDGFLYVYKG Y 

201 QPIDVVRDLPSGFNTLKPIFKLPL.GINITNFRAILTAFSPAQDIWGTSAA 

251 AYFVGYLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIY 

301 QTSNFRWPSGDWRFPNITNLC PFGEVFNATKF PSVYAWERKK I SNCVA 

351 DYSVLYNSTFFSTFKCYGVSATKLOT)L^ 

401 QTGVIADYNYKLPDDFMGCVXjAWN^ 

451 FERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVWLS 

501 FELLNAPATVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQ 

551 FGRDVSDFTDSVRDPKTSEILiDISPCAFGGVSVITPGTNASSEVAVLYQD 

601 WCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCLIGAEHVDTSYECDI 

651 PIGAGICASYHTVSLLRSTSQKS IVAYTMSLGADSS IAYSNNTIAIPTNF 

701 S I S ITTEVMPVSMAKTSVDCJSnXT^ ICGDSTECANLLLQYGSFCTQLNRTUjS 

751 GIAAEQDRNTREVFAQVKQMYKTPTLiKYFGGFNFSQIIiPDPLKPTKRSFI 

801 EDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGLTVLPPIiLTDD 

851 MI AAYTAALVSGTATAGWTFGAGAALiQ I P F AMQMA YRFNG I G VT ONVL YE 

901 NQKQIANQFNKAISQIQESLTTTSTAIiGKIfQDVVNQNAQAIiNTLVKQIiSS 

951 NFGAI SSVIiNDILSRLDKVEAEVQI DRL I TGRLQ SLQTYVTQQL IRAAE I 

1001 RASANL AATKMS E C VLGQ S KRVDF CGKG YHLMS F PQ AAPHG WF LHVT YV 

1051 PSQERNFTTAPAICHEGKAYFPREGVFVFNGTSWFITQR3SFFFSPQIITTD 

1101 NTFVSGNCDWIGI INNTVYDPLQPELDSFKEELDKYFKNHTS PDVDLGD 

1151 ISGII^SVVTJIQKEIDRLNEVAKNLNESIjIDIiQELG KYEQYIKW 

1201 GFIAGLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGV 

1251 KLHYT 
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a' d' a' d' a' d' a' d' a' d' 
SARS-HR2 ISGINASVVNIQKEIDRLNEVAKNIiNESLIDIjQEL (SEQ ID NO : 4 ) 

fb fb fb fb fb 
GCN4 RMKQLEDKVEELLSKNYHLENEVARLKKLVGER (SEQ ID NO : 5 ) 



(Cys)HR2G Ac-CGGrMKINEDVVEIQLSINYLNENAVAIiNKKLVGLQ-NH2 (SEQ ID NO: 6) 
(Cys)GCN4 Ac-CGGimKQLEDKVEELLSKNYHLENEVARLKKLVGER (SEQ ID NO: 7) 
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Beta-MeTrp 
H 2 N 

N NH * 
H 

Trp(5-CH 2 CH 2 NH 2 ) 




Trp(4,5,6,7-tetrafluoro) 
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Figure 4B 
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SEQUENCE LISTING 

<110> Merck & Co., Inc. 

Istituto Di Ricerche Di Biologia Molecolare P. Angeletti S.P.A. 

<120> INHIBITORS OF CORONAVIRUS 



<130> 21640Y 

<150> 60/479430 
<151> 2003-06-18 

<150> 60/479429 
<151> 2003-06-18 

<150> 60/478860 
<151> 2003-06-18 



<150> 60/467190 
<151> 2003-04-30 

<150> 60/463100 
<151> 2003-04-14 

<160> 45 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1255 
<212> PRT 
<213> SARS-CoV 



<400> 1 



Met Phe 


He 


Phe 


Leu 


Leu 


Phe 


Leu 


Thr 


Leu 


Thr 


Ser Gly 


Ser 


Asp 


Leu 


1 






5 










10 










15 




Asp Arg 


Cys 


Thr 
20 


Thr 


Phe 


Asp 


Asp 


Val 
25 


Gin 


Ala 


Pro 


Asn 


Tyr 
30 


Thr 


Gin 


His Thr 


Ser 
35 


Ser 


Met 


Arg 


Gly 


Val 
40 


Tyr 


Tyr 


Pro 


Asp 


Glu 
45 


He 


Phe 


Arg 


Ser Asp 


Thr 


Leu 


Tyr 


Leu 


Thr 


Gin 
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Phe 


Leu 
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Phe 


Tyr 


Ser 


50 
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Asn Val 


Thr 


Gly 


Phe 


His 


Thr 


He 


Asn 


His 


Thr 


Phe 


Gly Asn 


Pro 


Val 


65 






70 










75 










80 


He Pro 


Phe 


Lys 


Asp 
85 


Gly 


He 


Tyr 


Phe 


Ala 
90 


Ala 


Thr 


Glu 


Lys 


Ser 
95 


Asn 


Val Val 


Arg 


Gly 
100 


Trp 


Val 


Phe 


Gly 


Ser 
105 


Thr 


Met 


Asn 


Asn 


Lys 
110 


Ser 


Gin 


Ser Val 


He 
115 


He 


He 


Asn 


Asn 


Ser 
120 


Thr 


Asn 


Val 


Val 


He 
125 


Arg 


Ala 


Cys 


Asn Phe 


Glu 


Leu 


Cys 


Asp 


Asn 


Pro 


Phe 


Phe 


Ala 


Val 


Ser 


Lys 


Pro 


Met 


130 








135 










140 










Gly Thr 


Gin 


Thr 


His 


Thr 


Met 


He 


Phe 


Asp 


Asn 


Ala 


Phe 


Asn 


Cys 


Thr 


145 








150 








155 










160 


Phe Glu 


Tyr 


He 


Ser 
165 


Asp 


Ala 


Phe 


Ser 


Leu 
170 


Asp 


Val 


Ser 


Glu 


Lys 
175 


Ser 


Gly Asn 


Phe 


Lys 
180 


His 


Leu 


Arg 


Glu 


Phe 
185 


Val 


Phe 


Lys 


Asn 


Lys 
190 


Asp 


Gly 


Phe Leu 


Tyr 
195 


Val 


Tyr 


Lys 


Gly 


Tyr 
200 


Gin 


Pro 


He 


Asp 


Val 
205 


Val 


Arg 


Asp 



-1- 
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Leu 


Pro 


Ser 


Gly 


Phe 


Asn 


Thr 


Leu 


Lys 


Pro 


He 


Phe 


Lys 


Leu 


Pro 


Leu 




210 










215 








220 








Gly 


lie 


Asn 


He 


Thr 


Asn 


Phe 


Arg 


Ala 


He 


Leu 


Thr 


Ala 


Phe 


Ser 


Pro 


225 










230 








235 










240 


Ala 


Gin 


Asp 


He 


Trp 
245 


Gly 


Thr 


Ser 


Ala 


Ala 
250 


Ala 


Tyr 


Phe 


Val 


Gly 
255 


Tyr 


Leu 


Lys 


Pro 


Thr 


Thr 


Phe 


Met 


Leu 


Lys 


Tyr Asp 


Glu 


Asn 


Gly 


Thr 


He 






260 










265 










270 






Thr 


Asp 


Ala 


Val 


Asp 


Cys 


Ser 


Gin 


Asn 


Pro 


Leu 


Ala 


Glu 


Leu 


Lys 


Cys 




275 










280 










285 








Ser 


Val 


Lys 


Ser 


Phe 


Glu 


He 


Asp 


Lys 


Gly He 


Tyr 


Gin 


Thr 


Ser 


Asn 




290 










295 










300 










Phe 


Arg 


Val 


Val 


Pro 


Ser 


Gly 


Asp 


Val 


Val 


Arg 


Phe 


Pro 


Asn 


He 


Thr 


305 








310 








315 










320 


Asn 


Leu 


Cys 


Pro 


Phe 


Gly 


Glu 


Val 


Phe 


Asn 


Ala 


Thr 


Lys 


Phe 


Pro 


Ser 








325 










330 










335 




Val 


Tyr 


Ala 


Trp 
340 


Glu 


Arg 


Lys 


Lys 


He 
345 


Ser 


Asn 


Cys 


Val 


Ala 
350 


Asp 


Tyr 


Ser 


Val 


Leu 
355 


Tyr 


Asn 


Ser 


Thr 


Phe 
360 


Phe 


Ser 


Thr 


Phe 


Lys 
365 


Cys 


Tyr 


Gly 


Val 


Ser 
370 


Ala 


Thr 


Lys 


Leu 


Asn 
375 


Asp 


Leu 


Cys 


Phe 


Ser 
380 


Asn 


Val 


Tyr 


Ala 


Asp 


Ser 


Phe 


Val 


Val 


Lys 


Gly 


Asp 


Asp 


Val 


Arg 


Gin 


He 


Ala 


Pro 


Gly 


385 










390 










395 










400 


Gin 


Thr 


Gly 


Val 


He 


Ala 


Asp 


Tyr 


Asn 


Tyr Lys 


Leu 


Pro 


Asp 


Asp 


Phe 










405 










410 










415 




Met 


Gly 


Cys 


Val 


Leu 


Ala 


Trp 


Asn 


Thr 


Arg Asn 


He 


Asp 


Ala 


Thr 


Ser 








420 










425 










430 






Thr 


Gly 


Asn 


Tyr 


Asn 


Tyr 


Lys 


Tyr 


Arg 


Tyr Leu 


Arg 


His 


Gly 


Lys 


Leu 






435 










440 










445 








Arg 


Pro 
450 


Phe 


Glu 


Arg 


Asp 


He 
455 


Ser 


Asn 


Val 


Pro 


Phe 
460 


Ser 


Pro 


Asp 


Gly 


Lys 


Pro 


Cys 


Thr 


Pro 


Pro 


Ala 


Leu 


Asn 


Cys 


Tyr 


Trp 


Pro 


Leu 


Asn 


Asp 


465 










470 










475 










480 


Tyr 


Gly 


Phe 


Tyr 


Thr 


Thr 


Thr 


Gly 


He 


Gly Tyr 


Gin 


Pro 


Tyr 


Arg 


Val 










485 










490 










495 




Val 


Val 


Leu 


Ser 


Phe 


Glu 


Leu 


Leu 


Asn 


Ala 


Pro 


Ala 


Thr 


Val 


Cys 


Gly 








500 










505 










510 


Pro 


Lys 


Leu 
515 


Ser 


Thr 


Asp 


Leu 


He 
52 0 


Lys 


Asn 


Gin 


Cys 


Val 
525 


Asn 


Phe 


Asn 


Phe 


Asn 
530 


Gly 


Leu 


Thr 


Gly 


Thr 
535 


Gly 


Val 


Leu 


Thr 


Pro 
540 


Ser 


Ser 


Lys 


Arg 


Phe 


Gin 


Pro 


Phe 


Gin 


Gin 


Phe 


Gly 


Arg 


Asp 


Val 


Ser 


Asp 


Phe 


Thr 


Asp 


545 










550 










555 










560 


Ser 


Val 


Arg 


Asp 


Pro 
565 


Lys 


Thr 


Ser 


Glu 


He 
570 


Leu 


Asp 


He 


Ser 


Pro 
575 


Cys 


Ala 


Phe 


Gly 


Gly 


Val 


Ser 


Val 


He 


Thr 


Pro 


Gly 


Thr 


Asn 


Ala 


Ser 


Ser 








580 










585 








590 






Glu 


Val 


Ala 
595 


Val 


Leu 


Tyr 


Gin 


Asp 
600 


Val 


Asn 


Cys 


Thr 


Asp 
605 


Val 


Ser 


Thr 


Ala 


He 


His 


Ala 


Asp 


Gin 


Leu 


Thr 


Pro 


Ala 


Trp 


Arg 


He 


Tyr 


Ser 


Thr 




610 










615 








620 








Gly 


Asn 


Asn 


Val 


Phe 


Gin 


Thr 


Gin 


Ala 


Gly Cys 


Leu 


He 


Gly 


Ala 


Glu 


625 










630 










635 










640 


His 


Val 


Asp 


Thr 


Ser 
645 


Tyr 


Glu 


Cys 


Asp 


He 
650 


Pro 


He 


Gly 


Ala 


Gly 
655 


He 


Cys 


Ala 


Ser 


Tyr 


His 


Thr 


Val 


Ser 


Leu 


Leu Arg 


Ser 


Thr 


Ser 


Gin 


Lys 








660 










665 










670- 






Ser 


He 


Val 


Ala 


Tyr 


Thr 


Met 


Ser 


Leu 


Gly Ala 


Asp 


Ser 


Ser 


He 


Ala 






675 










680 










685 








Tyr 


Ser 
690 


Asn 


Asn 


Thr 


He 


Ala 
695 


He 


Pro 


Thr 


Asn 


Phe 
700 


Ser 


He 


Ser 


He 
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Thr 


Thr 


Glu Val 


Met 


Pro 


Val 


Ser 


Met Ala 


Lys 


Thr 


Ser 


Val 


Asp Cys 


705 










710 








715 








720 


Asn 


Met 


Tyr 


He 


Cys 


Gly 


Asp 


Ser 


Thr Glu 


Cys 


Ala 


Asn 


Leu 


Leu Leu 










725 








730 








735 


Gin Tyr 


Gly 


Ser 


Phe 


Cys 


Thr 


Gin 


Leu Asn 


Arg 


Ala 


Leu 


Ser Gly He 








740 










745 








750 




Ala Ala 


Glu 


Gin 


Asp Arg 


Asn 


Thr 


Arg Glu 


Val 


Phe 


Ala 


Gin 


Val Lys 






755 










760 








765 




Gin 


Met 


Tyr Lys 


Thr 


Pro 


Thr 


Leu 


Lys Tyr 


Phe 


Gly 


Gly 


Phe 


Asn Phe 




770 










775 








780 








Ser 


Gin 


He 


Leu 


Pro 


Asp 


Pro 


Leu 


Lys Pro 


Thr 


Lys 


Arg 


Ser 


Phe He 


785 










790 






795 






800 


Glu 


Asp 


Leu 


Leu 


Phe 


Asn 


Lys 


Val 


Thr Leu 


Ala 


Asp 


Ala 


Gly Phe Met 










805 








810 










815 


Lys 


Gin 


Tyr 


Gly 
820 


Glu 


Cys 


Leu 


Gly 


Asp lie 
825 


Asn 


Ala 


Arg 


Asp 
830 


Leu He 


Cys 


Ala 


Gin 
835 


Lys 


Phe 


Asn 


Gly 


Leu 
840 


Thr Val 


Leu 


Pro 


Pro 
845 


Leu 


Leu Thr 


Asp 


Asp 


Met 


He 


Ala 


Ala 


Tyr 


Thr 


Ala Ala 


Leu 


Val 


Ser 


Gly Thr Ala 




850 










855 








860 








Thr 


Ala 


Gly Trp 


Thr 


Phe 


Gly 


Ala 


Gly Ala 


Ala 


Leu 


Gin 


He 


Pro Phe 


865 










870 








875 








880 


Ala 


Met 


Gin Met 


Ala 


Tyr 


Arg 


Phe 


Asn Gly 


He 


Gly 


Val 


Thr 


Gin Asn 










885 








890 










895 


Val 


Leu 


Tyr 


Glu 


Asn 


Gin 


Lys 


Gin 


He Ala 


Asn 


Gin 


Phe 


Asn 


Lys Ala 








900 










905 








910 


He 


Ser 


Gin 


He 


Gin 


Glu 


Ser 


Leu 


Thr Thr 


Thr 


Ser 


Thr 


Ala 


Leu Gly 






915 










920 








925 




Lys 


Leu 


Gin Asp 


Val 


Val 


Asn 


Gin 


Asn Ala 


Gin 


Ala 


Leu 


Asn 


Thr Leu 




930 










935 








940 








Val 


Lys 


Gin 


Leu 


Ser 


Ser 


Asn 


Phe 


Gly Ala 


He 


Ser 


Ser 


Val 


Leu Asn 


945 










950 








955 








960 


Asp 


He 


Leu 


Ser 


Arg 
965 


Leu 


Asp 


Lys 


Val Glu 
970 


Ala 


Glu 


Val 


Gin 


He Asp 
975 


Arg Leu 


He 


Thr 


Gly Arg 


Leu 


Gin 


Ser Leu 


Gin 


Thr 


Tyr 


Val 


Thr Gin 








980 










985 








990 




Gin 


Leu 


He 


Arg 


Ala 


Ala 


Glu 


He 


Arg Ala 


Ser 


Ala 


Asn 


Leu 


Ala Ala 






995 










1000 






1005 




Thr 


Lys 


Met 


Ser 


Glu 


Cys 


Val 


Leu 


Gly Gin 


Ser 


Lys 


Arg Val 


Asp Phe 




1010 








1015 






1020 




Cys 


Gly Lys 


Gly 


Tyr 


His 


Leu 


Met 


Ser Phe 


Pro 


Gin 


Ala 


Ala 


Pro His 


1025 








1030 






1035 






1040 


Gly Val 


Val 


Phe 


Leu 


His 


Val 


Thr 


Tyr Val 


Pro 


Ser 


Gin 


Glu Arg Asn 










1045 






1050 








1055 


Phe 


Thr 


Thr 


Ala 


Pro 


Ala 


He 


Cys 


His Glu 


Gly 


Lys 


Ala 


Tyr 


Phe Pro 








1060 








1065 








1070 


Arg Glu Gly Val 


Phe 


Val 


Phe 


Asn 


Gly Thr 


Ser 


Trp 


Phe 


He 


Thr Gin 






1075 








1080 




1085 




Arg Asn 


Phe 


Phe 


Ser 


Pro 


Gin 


He 


He Thr Thr Asp Asn Thr Phe Val 




1090 








1095 






1100 







Ser Gly Asn Cys Asp Val Val He Gly He He Asn Asn Thr Val Tyr 
1105 1110 1115 1120 

Asp Pro Leu Gin Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys 

1125 1130 1135 

Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp He Ser 

1140 1145 1150 

Gly He Asn Ala Ser Val Val Asn He Gin Lys Glu He Asp Arg Leu 

1155 1160 1165 

Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu He Asp Leu Gin Glu 

1170 1175 1180 

Leu Gly Lys Tyr Glu Gin Tyr He Lys Trp Pro Trp Tyr Val Trp Leu 
1185 1190 1195 1200 
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Gly Phe lie Ala Gly Leu lie Ala lie Val Met Val Thr lie Leu Leu 

1205 1210 1215 

Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Ala Cys Ser Cys 

1220 1225 1230 

Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val Leu Lys 

1235 1240 1245 

Gly Val Lys Leu His Tyr Thr 
1250 1255 



<210> 2 
<211> 57 
<212> PRT 
<213> SRS-CoV 



<400> 2 

Pro Asp Val Asp Leu Gly Asp lie 

1 5 
Asn He Gin Lys Glu lie Asp Arg 
20 

Asn Glu Ser Leu He Asp Leu Gin 

35 40 
He Lys Trp Pro Trp Tyr Val Trp 
5 0 55 



Ser Gly He Asn Ala Ser Val Val 

10 15 
Leu Asn Glu Val Ala Lys Asn Leu 
25 30 
Glu Leu Gly Lys Tyr Glu Gin Tyr 
45 

Leu 



<210> 3 
<211> 35 
<212> PRT 
<213> SARS-CoV 

<400> 3 

Leu Gly Asp He Ser Gly He Asn Ala Ser Val Val Asn He Gin Lys 

15 10 15 

Glu He Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu 
20 25 30 

He Asp Leu 
35 



<210> 4 
<211> 35 
<212> PRT 
<213> SARS-COV 

<400> 4 

He Ser Gly He Asn Ala Ser Val Val Asn He Gin Lys Glu He Asp 

15 10 15 

Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu He Asp Leu 
20 25 30 

Gin Glu Leu 
35 



<210> 5 
<211> 33 
<212> PRT 
<213> Yeast 

<400> 5 

Arg Met Lys Gin Leu Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn 
15 10 15 
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Tyr His Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu 
20 25 30 

Arg 



<210> 6 
<211> 36 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence of SARS-CoV sequence and yeast 
sequence 

<221> ACETYLATION 
<222> 1 

<221> AMI DAT I ON 
<222> 36 

<400> 6 

Cys Gly Gly lie Met Lys He Asn Glu Asp Val Val Glu He Gin Leu 

15 10 15 

Ser He Asn Tyr Leu Asn Glu Asn Ala Val Ala Leu Asn Lys Lys Leu 

20 < 25 30 

Val Gly Leu Gin 
35 



<210> 7 
<211> 36 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Modified yeast sequence 

<221> ACETYLATION 
<222> 1 

<400> 7 

Cys Gly Gly Arg Met Lys Gin Leu Glu Asp Lys Val Glu Glu Leu Leu 

1 5 10 15 

Ser Lys Asn Tyr His Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu 

20 25 30 

Val Gly Glu Arg 
35 



<210> 8 
<211> 39 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Modified SARS-CoV sequence 
<400> 8 

Cys Cys Gly Thr Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin Asp Val 
15 10 15 
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Val Asn Gin Asn Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu Ser 

20 25 30 

Ser Asn Phe Gly Ala lie Ser 
35 



<210> 9 
<211> 32 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Modified SARS-CoV sequence 
<400> 9 

Cys Cys Gly Glu Asn Gin Lys Gin He Ala Asn Gin Phe Asn Lys Ala 

15 10 15 

He Ser Gin He Gin Glu Ser Leu Thr Thr Thr Ser Thr Ala Leu Gly 
20 25 30 



<210> 10 
<211> 41 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Modified SARS-CoV sequence 
<400> 10 

Cys Cys Gly Ala Leu Asn Thr Leu Val Lys Gin Leu Ser Ser Asn Phe 

15 10 15 

Gly Ala He Ser Ser Val Leu Asn Asp He Leu Ser Arg Leu Asp Lys 

20 25 30 

Val Glu Ala Glu Val Gin He Asp Arg 
35 40 



<210> 11 
<211> 33 
<212> PRT 
<213> Yeast 



<400> 11 

Arg Met Lys Gin He Glu Asp Lys 

1 5 
Tyr His He Glu Asn Glu He Ala 
20 

Arg 



He Glu Glu He Leu Ser Lys Gin 

10 15 
Arg He Lys Lys Leu He Gly Glu 
25 30 



<210> 12 
<211> 34 
<212> PRT 
<213> Yeast 



<400> 12 

Arg Met Lys Gin He Glu Asp Lys 

1 5 
Lys Lys He Glu Asn Glu He Ala 
20 



He Glu Glu He Glu Ser Lys Gin 

10 15 
Arg He Lys Lys Leu lie Gly Glu 
25 30 
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Arg Tyr 



<210> 13 
<211> 15 
<212> PRT 
<213> Yeast 

<400> 13 

Lys lie Lys Lys He Glu Asn Glu He Ala Arg He Lys Lys Leu 
1 5 10 15 



<210> 14 
<211> 31 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derivative of Leucine zipper sequence 

<221> ACETYLATION 
<222> 1 

<221> AMTDATION 
<222> 31 

<400> 14 

Tyr Gly Gly He Glu Lys Lys He Glu Ala He Glu Lys Lys He Glu 

1 5 10 15 

Ala He Glu Lys Lys He Glu Ala He Glu Lys Lys He Glu Ala 
20 25 30 



<210> 15 
<211> 31 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derivative of isoleucine zipper sequence 

<221> ACETYLATION 
<222> 1 

<221> AMIDATION 
<222> 31 

<400> 15 

Tyr Gly Gly He Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu 

1 5 10 15 

Ala lie Lys Lys Lys He Glu Ala He Glu Lys Glu He Glu Ala 
20 25 30 



<210> 16 
<211> 64 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Chimeric sequence including SARS-CoV sequence 
<400> 16 

Thr lie Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu Ala He 

15 10 15 

Lys Lys Lys He Glu Ala He Glu Lys Glu He Thr Thr Thr Ser Thr 

20 25 30 

Ala Leu Gly Lys Leu Gin Asp Val Val Asn Gin Asn Ala Gin Ala Leu 

35 40 45 

Asn Thr Leu Val Lys Gin Leu Ser Ser Asn Phe Gly Ala He Ser Gly 
50 55 60 



<210> 17 
<211> 68 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence including SARS-CoV sequence 
<400> 17 

Thr Arg Met Lys Gin He Glu Asp Lys He Glu Glu He Glu Ser Lys 

15 10 15 

Gin Lys Lys He Glu Asn Glu He Ala Arg He Lys Lys Leu He Thr 

20 25 30 

Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn Gin Asn 

35 40 45 

Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu Ser Ser Asn Phe Gly 

50 55 60 

Ala He Ser Gly 
65 



<210> 18 






<211> 66 






<212> PRT 






<213> Artificial Sequence 




<220> 






<223> chimeric sequence 






<400> 18 






Arg Met Lys Gin He Glu 


Asp 


Lys 


1 5 


Lys Lys He Glu Asn Glu 


He 


Ala 


20 






He Gin Glu Ser Leu Thr 


Thr 


Thr 


35 




40 


Asp Val Val Asn Gin Asn 


Ala 


Gin 


50 


55 




Leu Ser 







He Glu Glu He Glu Ser Lys Gin 

10 15 
Arg He Lys Lys Leu He Ser Gin 
25 30 
Ser Thr Ala Leu Gly Lys Leu Gin 
45 

Ala Leu Asn Thr Leu Val Lys Gin 
60 



<210> 19 

<211> 59 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Chimeric sequence including SARS-CoV sequence 
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<400> 19 

Arg Met Lys Gin lie Glu Asp Lys He Glu Glu He Glu Ser Lys Gin 

15 10 15 

Lys Lys He Glu Asn Glu He Ala Arg He Lys Lys Leu He Glu Asn 

20 25 30 

Gin Lys Gin He Ala Asn Gin Phe Asn Lys Ala He Ser Gin He Gin 

35 40 45 

Glu Ser Leu Thr Thr Thr Ser Thr Ala Leu Gly 
50 55 



<210> 20 
<211> 59 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 



<400> 20 






















Thr Arg Met 


Lys 


Gin 


He 


Glu 


Asp Lys He Glu 


Glu 


He 


Glu 


Ser 


Lys 


1 




5 






10 








15 




Gin Lys Lys 


He 


Glu 


Asn 


Glu 


He Ala Arg He 


Lys 


Lys 


Leu 


Glu 


Asn 




20 








25 






30 






Gin Lys Gin 


He 


Ala 


Asn 


Gin 


Phe Asn Lys Ala 


He 


Ser 


Gin 


He 


Gin 


35 










40 




45 








Glu Ser Leu 


Thr 


Thr 


Thr 


Ser 


Thr Ala Leu Gly 












50 








55 













<210> 21 
<211> 56 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 21 

Thr He Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu Ala He 

15 10 15 

Lys Lys Lys He Glu Ala He Glu Lys Glu lie Glu Asn Gin Lys Gin 

20 25 30 

lie Ala Asn Gin Phe Asn Lys Ala He Ser Gin He Gin Glu Ser Leu 

35 40 45 

Thr Thr Thr Ser Thr Ala Leu Gly 
50 55 



<210> 22 
<211> 49 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 22 

Thr He Lys Lys Glu lie Glu Ala He Lys Lys Glu Gin Glu Ala lie 
15 10 15 
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Lys Lys Lys lie Glu Ala lie Glu Lys Arg Leu Gin Ser Leu Gin Thr 

20 25 30 

Tyr Val Thr Gin Gin Leu lie Arg Ala Ala Glu lie Arg Ala Ser Ala 
35 40 45 

Asn 



<210> 23 
<211> 53 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 23 

Thr Arg Met Lys Gin He Glu Asp Lys He Glu Glu He Glu Ser Lys 

15 10 15 

Gin Lys Lys He Glu Asn Glu He Ala Arg He Lys Lys Arg Leu Gin 

20 25 30 

Ser Leu Gin Thr Tyr Val Thr Gin Gin Leu He Arg Ala Ala Glu He 

35 40 45 

Arg Ala Ser Ala Asn 
50 



<210> 24 
<211> 67 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 24 

Thr Arg Met Lys Gin He Glu Asp Lys He Glu Glu He Glu Ser Lys 

15 10 15 

Gin Lys Lys lie Glu Asn Glu He Ala Arg He Lys Lys Ala Leu Asn 

20 25 30 

Thr Leu Val Lys Gin Leu Ser Ser Asn Phe Gly Ala He Ser Ser Val 

35 40 45 

Leu Asn Asp He Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gin 

50 55 60 

He Asp Arg 
65 



<210> 25 
<211> 63 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 25 

Thr He Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu Ala He 

1 5 10 15 

Lys Lys Lys He Glu Ala He Glu Lys Ala Leu Asn Thr Leu Val Lys 
20 25 30 
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Gin Leu Ser Ser Asn Phe Gly Ala lie Ser Ser Val Leu Asn Asp He 

35 40 45 

Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gin He Asp Arg 
50 55 60 



<210> 26 
<211> 62 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> chimeric sequence 
<400> 26 

He Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu Ala He Lys 

1 5 io 15 

Lys Lys He Glu Ala He Glu Lys Glu He Ser Gin He Gin Glu Ser 

20 25 30 

Leu Thr Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn 

35 40 45 

Gin Asn Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu Ser 
50 55 60 



<210> 27 
<211> 58 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> chimeric sequence 
<400> 27 

lie Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu Ala He Lys 

1 5 io 15 

Lys Lys He Glu Ala He Asn Val Leu Tyr Glu Asn Gin Lys Gin He 

20 25 30 

Ala Asn Gin Phe Asn Lys Ala He Ser Gin He Gin Glu Ser Leu Thr 

35 40 45 

Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin 
50 55 



<210> 28 
<211> 62 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 28 

lie Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu Ala He Lys 

15 10 15 

Lys Lys lie Glu Ala He Glu Lys Glu He Glu Asn Gin Lys Gin lie 

20 25 30 

Ala Asn Gin Phe Asn Lys Ala lie Ser Gin lie Gin Glu Ser Leu Thr 

35 40 45 

Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn 
50 55 60 
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<210> 29 
<211> 62 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 29 

lie Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu Ala He Lys 

15 10 15 

Lys Lys He Glu Ala He Glu Lys Glu He Asn Gin Asn Ala Gin Ala 

20 25 30 

Leu Asn Thr Leu Val Lys Gin Leu Ser Ser Asn Phe Gly Ala He Ser 

35 40 45 

Ser Val Leu Asn Asp He Leu Ser Arg Leu Asp Lys Val Glu 
50 55 60 



<210> 30 
<211> 41 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 30 

He Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu Ala He Lys 

15 10 15 

Lys Lys He Glu Ala He Glu Lys Ser Leu Thr Thr Thr Ser Thr Ala 

20 25 30 

Leu Gly Lys Leu Gin Asp Val Val Asn 
35 40 



<210> 31 
<211> 41 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 31 

He Lys Lys Glu He Glu Ala He Lys Lys Glu Gin Glu Ala He Lys 

15 10 15 

Lys Lys He Glu Ala lie Glu Lys Ala He Ser Ser Val Leu Asn Asp 

20 25 30 

He Leu Ser Arg Leu Asp Lys Val Glu 
35 40 



<210> 32 
<211> 64 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> chimeric sequence 
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<400> 32 

Ser Gin lie Gin Glu Ser Leu Thr 

1 5 
Leu Gin Asp Val Val Asn Gin Asn 
20 

Lys Gin Leu Ser Ser lie Lys Lys 

35 40 
Gin Glu Ala lie Lys Lys Lys He 
50 55 



Thr Thr Ser Thr Ala Leu Gly Lys 

10 15 
Ala Gin Ala Leu Asn Thr Leu Val 
25 30 
Glu He Glu Ala He Lys Lys Glu 
45 

Glu Ala He Glu Lys Glu He Gly 
60 



<210> 33 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> designed linker sequence 
<400> 33 

Gly Gly Gly Ser Ser Gly Gly Gly Ser Gly Gly 
1 5 10 



<210> 34 
<211> 290 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 34 

Met Val Leu Tyr Glu Asn Gin Lys Gin He Ala Asn Gin Phe Asn Lys 

15 10 15 

Ala He Ser Gin He Gin Glu Ser Leu Thr Thr Thr Ser Thr Ala Leu 

20 25 30 

Gly Lys Leu Gin Asp Val Val Asn Gin Asn Ala Gly Gly Gly Ser Ser 

35 40 45 

Gly Gly Gly Ser Gly Gly Asp He Ser Gly He Asn Ala Ser Val Val 

50 55 60 

Asn He Gin Lys Glu He Asp Arg Leu Asn Glu Val Ala Lys Asn Leu 
65 70 75 80 

Asn Glu Ser Leu He Asp Leu Gin Glu Leu Gly Gly Gly Ser Ser Gly 

85 90 ^ 95 

Gly Gly Ser Gly Gly Val Leu Tyr Glu Asn Gin Lys Gin He Ala Asn 

100 105 HO 

Gin Phe Asn Lys Ala He Ser Gin He Gin Glu Ser Leu Thr Thr Thr 

115 120 125 

Ser Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn Gin Asn Ala Gly 

130 135 140 

Gly Gly Ser Ser Gly Gly Gly Ser Gly Gly Asp He Ser Gly lie Asn 
145 150 155 160 

Ala Ser Val Val Asn He Gin Lys Glu He Asp Arg Leu Asn Glu Val 

165 170 " 175 

Ala Lys Asn Leu Asn Glu Ser Leu lie Asp Leu Gin Glu Leu Gly Gly 

180 185 190 

Gly Ser Ser Gly Gly Gly Ser Gly Gly Val Leu Tyr Glu Asn Gin Lys 

195 200 205 

Gin lie Ala Asn Gin Phe Asn Lys Ala lie Ser Gin He Gin Glu Ser 
210 215 220 
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Leu Thr Thr 


Thr 


Ser 


Thr 


Ala 


Leu 


225 






230 






Gin Asn Ala 


Gly Gly 


Gly 


Ser 


Ser 






245 








Ser Gly lie 


Asn 


Ala 


Ser 


Val 


Val 




260 










Leu Asn Glu 


Val 


Ala 


Lys 


Asn 


Leu 


275 








280 


Glu Leu 













290 



Gly Lys Leu Gin Asp Val Val Asn 
235 240 
Gly Gly Gly Ser Gly Gly Asp He 

250 255 
Asn He Gin Lys Glu He Asp Arg 
265 270 
Asn Glu Ser Leu He Asp Leu Gin 
285 



<210> 35 
<211> 284 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 
<400> 35 

Met Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn Gin Asn Ala Gin 

1 5 10 15 

Ala Leu Asn Thr Leu Val Lys Gin Leu Ser Ser Asn Phe Gly Ala He 

20 25 30 

Ser Ser Val Leu Asn Asp He Leu Ser Gly Gly Gly Ser Ser Gly Gly 

35 40 45 

Gly Ser Gly Gly Asp He Ser Gly He Asn Ala Ser Val Val Asn He 

50 55 60 

Gin Lys Glu He Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu 
65 70 75 80 

Ser Leu He Asp Leu Gin Glu Leu Gly Gly Gly Ser Ser Gly Gly Gly 

85 90 95 

Ser Gly Gly Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn Gin Asn 

100 105 ~ no 

Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu Ser Ser Asn Phe Gly 

115 120 125 

Ala He Ser Ser Val Leu Asn Asp He Leu Ser Gly Gly Gly Ser Ser 

130 135 140 

Gly Gly Gly Ser Gly Gly Asp He Ser Gly He Asn Ala Ser Val Val 
145 , 150 155 160 

Asn He Gin Lys Glu He Asp Arg Leu Asn Glu Val Ala Lys Asn Leu 

165 170 175 

Asn Glu Ser Leu lie Asp Leu Gin Glu Leu Gly Gly Gly Ser Ser Gly 

180 185 ~ 190 

Gly Gly Ser Gly Gly Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn 

195 200 205 

Gin Asn Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu Ser Ser Asn 

210 215 220 

Phe Gly Ala He Ser Ser Val Leu Asn Asp He Leu Ser Gly Gly Gly 
22 5 230 235 240 

Ser Ser Gly Gly Gly Ser Gly Gly Asp He Ser Gly He Asn Ala Ser 

245 250 255 

Val Val Asn He Gin Lys Glu He Asp Arg Leu Asn Glu Val Ala Lys 

260 265 270 

Asn Leu Asn Glu Ser Leu He Asp Leu Gin Glu Leu 
275 280 



<210> 36 
<211> 284 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Chimeric sequence containing SARS-CoV sequence 



<400> 36 



Met 


Gin 


He 


Asp 


Arg 


Leu 


He 


Thr Gly Arg 


Leu 


Gin 


Ser 


Leu 


Gin 


Thr 


1 








5 








10 










15 




Tyr 


Val 


Thr 


Gin 


Gin 


Leu 


He 


Arg Ala Ala 


Glu 


He 


Arg 


Ala 


Ser 


Ala 






20 










25 








30 






Asn 


Leu 


Ala 


Ala 


Thr 


Lys 


Met 


Ser Glu Gly 


Gly 


Gly 


Ser 


Ser 


Gly 


Gly 






35 










40 








45 








Gly 


Ser 


Gly 


Gly 


Asp 


He 


Ser 


Gly He Asn 


Ala 


Ser 


Val 


Val 


Asn 


He 




50 










55 








60 










Gin 


Lys 


Glu 


He 


Asp 


Arg 


Leu 


Asn 


Glu Val 


Ala 


Lys 


Asn 


Leu 


Asn 


Glu 


65 










70 








75 










80 


Ser 


Leu 


He 


Asp 


Leu 
85 


Gin 


Glu 


Leu 


Gly Gly 
90 


Gly 


Ser 


Ser 


Gly 


Gly 
95 


Gly 


Ser 


Gly 


Gly 


Gin 
100 


He 


Asp 


Arg 


Leu 


He Thr 
105 


Gly 


Arg 


Leu 


Gin 
110 


Ser 


Leu 


Gin 


Thr 


Tyr 
115 


Val 


Thr 


Gin 


Gin 


Leu 
120 


He Arg 


Ala 


Ala 


Glu 
125 


He 


Arg 


Ala 


Ser 


Ala 
130 


Asn 


Leu 


Ala 


Ala 


Thr 
135 


Lys 


Met Ser 


Glu 


Gly 
140 


Gly 


Gly 


Ser 


Ser 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Asp 


He 


Ser Gly 


He 


Asn 


Ala 


Ser 


Val 


Val 


145 










150 








155 










160 


Asn 


lie 


Gin 


Lys 


Glu 
165. 


He 


Asp 


Arg 


Leu Asn 
170 


Glu 


Val 


Ala 


Lys 


Asn 
175 


Leu 


Asn 


Glu 


Ser 


Leu 
180 


He 


Asp 


Leu 


Gin 


Glu Leu 
185 


Gly 


Gly 


Gly 


Ser 
190 


Ser 


Gly 


Gly 


Gly 


Ser 
195 


Gly 


Gly 


Gin 


He 


Asp 
200 


Arg Leu 


He 


Thr 


Gly 
205 


Arg 


Leu 


Gin 


Ser 


Leu 


Gin 


Thr 


Tyr 


Val 


Thr 


Gin 


Gin Leu 


He 


Arg 


Ala 


Ala 


Glu 


He 




210 








215 








220 










Arg 


Ala 


Ser 


Ala 


Asn 


Leu 


Ala 


Ala 


Thr Lys 


Met 


Ser 


Glu 


Gly 


Gly 


Gly 


225 










230 








235 










240 


Ser 


Ser 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly Asp He 


Ser 


Gly 


He 


Asn 


Ala 


Ser 










245 








250 










255 




Val 


Val 


Asn 


He 
260 


Gin 


Lys 


Glu 


He 


Asp Arg 
265 


Leu 


Asn 


Glu 


Val 
270 


Ala 


Lys 


Asn 


Leu 


Asn 
275 


Glu 


Ser 


Leu 


He 


Asp 
280 


Leu Gin 


Glu 


Leu 











<210> 37 
<211> 374 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> chimeric sequence 
<400> 37 

Met Gly He Gly Val Thr Gin Asn 

1 5 
He Ala Asn Gin Phe Asn Lys Ala 
20 

Thr Thr Thr Ser Thr Ala Leu Gly 

35 40 
Asn Ala Gin Ala Leu Asn Thr Leu 
50 55 



Val Leu 


Tyr 


Glu 


Asn 


Gin 


Lys 


Gin 


10 










15 




He Ser 


Gin 


He 


Gin 


Glu 


Ser 


Leu 


25 








30 






Lys Leu 


Gin 


Asp 


Val 
45 


Val 


Asn 


Gin 


Val Lys 


Gin 


Leu 
60 


Ser 


Gly 


Gly 


Gly 
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Ser Ser Gly Gly Gly Ser Gly Gly Leu Gly Asp He Ser Gly He Asn 
65 70 75 " 80 

Ala Ser Val Val Asn He Gin Lys Glu He Asp Arg Leu Asn Glu Val 

85 90 95 

Ala Lys Asn Leu Asn Glu Ser Leu He Asp Leu Gin Glu Leu Gly Lys 

100 105 110 

Tyr Glu Gin Tyr He Lys Gly Gly Gly Ser Ser Gly Gly Gly Ser Gly 

115 120 125 

Gly Gly He Gly Val Thr Gin Asn Val Leu Tyr Glu Asn Gin Lys Gin 

130 135 140 

He Ala Asn Gin Phe Asn Lys Ala He Ser Gin He Gin Glu Ser Leu 
145 150 155 160 

Thr Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn Gin 

165 170 175 

Asn Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu Ser Gly Gly Gly 

180 185 190 

Ser Ser Gly Gly Gly Ser Gly Gly Leu Gly Asp He Ser Gly He Asn 

195 200 205 

Ala Ser Val Val Asn He Gin Lys Glu He Asp Arg Leu Asn Glu Val 

210 215 220 

Ala Lys Asn Leu Asn Glu Ser Leu He Asp Leu Gin Glu Leu Gly Lys 
225 230 235 240 

Tyr Glu Gin Tyr He Lys Gly Gly Gly Ser Ser Gly Gly Gly Ser Gly 

245 250 255 

Gly Gly He Gly Val Thr Gin Asn Val Leu Tyr Glu Asn Gin Lys Gin 

260 265 270 

He Ala Asn Gin Phe Asn Lys Ala He Ser Gin He Gin Glu Ser Leu 

275 280 285 

Thr Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn Gin 

290 295 300 

Asn Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu Ser Gly Gly Gly 
305 310 315 320 

Ser Ser Gly Gly Gly Ser Gly Gly Leu Gly Asp He Ser Gly He Asn 

325 330 335 

Ala Ser Val Val Asn He Gin Lys Glu He Asp Arg Leu Asn Glu Val 

340 345 350 

Ala Lys Asn Leu Asn Glu Ser Leu He Asp Leu Gin Glu Leu Gly Lys 

355 360 365 

Tyr Glu Gin Tyr He Lys 
370 



<210> 38 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> enhancer region containg tryptophan and / or a 
tryptophan analog. 

<221> M0D_RES 
<222> 1, 4 

<223> Xaa = independently either tryptophan or a 
tryptophan analog. 

<400> 38 

Xaa Gin Glu Xaa Glu Gin Lys 
1 5 



<210> 39 
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<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> enhancer region containg tryptophan and / or a 
tryptophan analog. 

<221> MOD_RES 
<222> 1, 3 

<223> Xaa = independently either tryptophan or a 
tryptophan analog 

<400> 39 

Xaa Pro Xaa Tyr Val Xaa Leu 
1 5 



<210> 40 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> enhancer region containg tryptophan. 
<400> 40 

Trp Gin Glu Trp Glu Gin Lys lie 
1 5 



<210> 41 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> enhancer region containg tryptophan. 
<400> 41 

Trp Pro Trp Tyr Val Trp Leu 
1 5 



<210> 42 
<211> 42 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> SARS-CoV sequence containg tryptophan and / or a 
tryptophan analog. 

<221> MOD_RES 
<222> 36, 38 

<223> Xaa = independently either tryptophan or a 
tryptophan analog 

<400> 42 

Leu Gly Asp lie Ser Gly He Asn Ala Ser Val Val Asn He Gin Lys 
15 10 15 
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Glu He Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu 

20 25 3 ° 

He Asp Leu Xaa Pro Xaa Tyr Val Xaa Leu 
35 40 



<210> 43 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derivative of SARS-CoV sequence 



<400> 43 „ - n 

Pro Asp Val Asp Lys Gly Asp He Ser Gly He Asn Ala Ser Val Val 

15 10 15 

Asn He Gin Lys Glu He Asp Arg Leu Asn Glu Val Ala Lys Asn Leu 
20 25 30 

Asn Glu Ser 
35 



<210> 44 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derivative of SARS-CoV sequence 

<221> M0D_RES 

<222> 10 

<223> Xaa = Dbu 

<r400> 44 

Asp Val Asp Leu Gly Glu He Ser Gly Xaa Asn Ala Ser Val Val Asn 

15 10 15 

He Gin Lys Glu He Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn 
20 25 30 

Glu Ser Leu 
35 



<210> 45 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derivative of SARS-CoV sequence 

<221> M0D_RES 
<222> 12 
<223> Xaa = Orn 

<400> 45 

Val Asp Leu Gly Asp 

1 5 
Gin Lys Glu He Asp 
20 



He Ser Gly He Asn Ala Xaa Val Val Asn He 

10 15 
Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu 
25 30 
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Ser Leu lie 
35 
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